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Abstract: We discuss various analytical approximation methods for following 
the evolution of cosmological density perturbations into the strong (i.e. non- 
linear) clustering regime. We start by giving a thorough treatment of linear 
gravitational instability in cosmological models and discussing the statistics of 
primordial density fluctuations produced in various scenarios of structure for- 
mation, and the role of non-baryonic dark matter. We critically review various 
methods for dealing with the non-linear evolution of density inhomogeneities, 
in the context of theories of the formation of galaxies and large-scale struc- 
ture. These methods can be classified into five types: (i) simple extrapolations 
from linear theory, such as the high-peak model and the lognormal model; 
(ii) dynamical approximations, including the Zel'dovich approximation and its 
extensions; (iii) non-linear models based on purely geometric considerations, 
of which the main example is the Voronoi model; (iv) statistical solutions in- 
volving scaling arguments, such as the hierarchical closure ansatz for BBGKY, 
fractal models and the thermodynamic model of Saslaw; (v) numerical tech- 
niques based on particles and/or hydrodynamics. We compare the results of 
full dynamical evolution using particle codes and the various other approxima- 
tion schemes. To put the models we discuss into perspective, we give a brief 
review of the observed properties of galaxy clustering and the statistical meth- 
ods used to quantify it, such as correlation functions, power spectra, topology 
and spanning trees. 
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1. Introduction 



The last two decades have been an exciting period for astrophysicists working 
on the problem of the origin of the large-scale structure of the Universe. Over 
this period, as a result of a continual interplay between theory and observation, 
a picture has emerged within which most of the properties of galaxy clustering 
can be understood. 

In this picture the Universe is almost homogeneous in the early stages, but has 
some small amplitude density inhomogeneities with a characteristic spectrum. 
The theory of inflation can offer explanations for both the initial almost- 
smoothness and the small density perturbations. As the Universe evolves, 
these perturbations grow by gravitational instability, at first linearly (so that 
the evolution of the different spatial Fourier modes are independent and the 
perturbations have a small amplitude), but eventually forming non-linear con- 
centrations of mass (where the Fourier modes are coupled together and the 
density perturbations are large); these non-linear concentrations are the even- 
tual sites for galaxy and cluster formation. An important ingredient in this 
picture is the (postulated) existence of large quantities of non-baryonic dark 
matter which allow fluctuations to begin to grow even in the early phases 
of the evolution of the Universe when radiation pressure inhibits the growth 
of structure in the baryonic material. This general idea has some empirical 
confirmation from both galaxy clustering data and, most recently, by the dis- 
covery by the COBE satellite of small anisotropics in the Cosmic Microwave 
Background radiation, which may be the signature of primordial density per- 
turbations. 

Successful though this basic picture has been, it has to be admitted that at 
present there is no model of galaxy and structure formation that can account 
for all the observational data. Models with different kinds of dark matter, 
different initial fluctuation spectra, different relationships between luminous 
material and dark matter have all been suggested at various times but none 
has been successful enough to deserve the status of a "standard model" of 
galaxy formation. 

The most fundamental problem one must tackle in a theory of the origin of 
large-scale structure is that structures around us in the Universe today cor- 
respond to density fluctuations many orders of magnitude greater than the 
mean density of the Universe. A galaxy cluster, for example, will have a den- 
sity around a factor 1000 greater than the mean density of cosmic material. 
This means that such structures are non-linear objects. While the growth of 
density fluctuations in an expanding Universe is tractible analytically if the 
fluctuations are small (using linear perturbation theory), there is no general 
exact solution for the non-linear regime. Traditionally, astrophysicists have 
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therefore resorted to numerical iV-body methods to understand this latter 
regime. Alternatively, one can try to look to scales large enough so that den- 
sity fluctuations are small enough that linear theory is still applicable. The 
problem with the former approach is that such methods are extremely time- 
consuming and can only be used to look at a small part of the parameter space 
of possible models. The latter approach has two weaknesses. One is that it is 
extremely hard to obtain direct empirical information about galaxy clustering 
on such large scales (and galaxy clustering need not give us direct information 
about the mass distribution). The second problem is that we actually want 
to understand the formation of non-linear objects, not just the super-large 
structures which are still evolving more-or-less linearly. 

What has been missing is a good theoretical understanding of the non-linear 
regime of gravitational clustering. Being able to produce a particular clustering 
property in a numerical simulation is not the same as understanding it! 

In recent years, however, there has been a great deal of interest in ana- 
lytical methods for studying this difficult problem. Various approximation 
schemes have been suggested which can be applied to study different aspects 
of the growth of non-linearity in the cosmological mass distribution. These 
range from straightforward perturbative extensions of linear theory, to simpli- 
fied kinematical models, scaling solutions and arguments based on statistical 
mechanics. The description of galaxy clustering is essentially statistical, but 
its origin is dynamical. Different aspects of the clustering pattern are there- 
fore reproduced with differing degrees of accuracy by different approximation 
schemes. 

Our aim in this review is to give a thorough treatment of the most important of 
these approximation schemes, explaining their physical motivation and, where 
appropriate, suggesting situations where they might be sufficiently accurate 
to be useful. We have tried to make the paper as self-contained as possible, 
so we have included a fairly thorough grounding in linear perturbation theory 
before introducing the non-linear approximations. To put these methods in 
context, we have also included a discussion of numerical simulation methods 
and some aspects of the statistical analysis of galaxy clustering. We do, how- 
ever, assume basic knowledge of physical cosmology such as can be obtained 
from P^ , [447|J579|| . 



2. Cosmological Perturbations and Linear Evolution 
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2.1. Field Equations for the Unperturbed Universe. 



The Universe is generally believed to be homogeneous and isotropic on scales 
greater than several hundred Mpc. This assumption is largely based upon the 
observed homogeneity in the large scale distribution of galaxies, radio galaxies 
and QSO's. In addition, the extremely small degree of anisotropy of the Cosmic 
Microwave Background Radiation (CMBR) (which travels to us virtually un- 
hindered from the surface of last scattering located at a cosmological redshift 
of ~ 1100) indicates that the Universe was extremely isotropic (to one part in 
~ 10 5 ) at the time when matter and radiation decoupled. This suggests that 
our basic starting point[] for the study of the origin of inhomogeneity should 
be an isotropic and homogeneous background model. 



2.1.1. FRW Models 

The Friedmann-Robertson- Walker line element describes a homogeneous and 
isotropic Universe^ 

ds i = d e-a 2 {t){^^+r 2 de 2 + r 2 sin 2 6dA « = 0, ± 1 (1) 

\ 1 — KT J 

where a(t) is the cosmic scale factor. The dynamics of a FRW Universe is 
determined by the Einstein field equations: G v = 8nGT^ where is the 
energy-momentum tensor describing the contents of the Universe. For a per- 
fect fluid p = Tq , P = — T£ and the Einstein equations simplify considerably 
to 

87rGp + A 



a J a? 

d AtxG . „„. A 

- = ^-(P + 3P) + - 

a 6 6 

P = -3-(p + P). (2) 
a 

The cosmological constant A increases the acceleration of the Universe com- 
pared to models containing only matter and/or radiation. It has been intro- 
duced on several occasions into the theory in attempts to make cosmological 
models more compatible with observations. A central issue, presently unre- 
solved in cosmology, is the current value of the Hubble parameter and the 
matter density. Although the density of the Cosmic Microwave Background 

1 We should mention that, although considerable effort has been directed towards the study 
of perturbations, rather less effort has been expended on the subject of how to fit a back- 



ground model to an observed lumpy Universe in an optimal way 1 184 
2 We set the velocity of light c = 1 for simplicity. 
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radiation is known to a high degree of accuracy to be p r = 4.7 x 10~ 34 g cm -3 , 
the density of matter is known only to within an order of magnitude, the un- 
certainty in p being usually expressed in terms of the dimensionless density 
parameter 

Present observations limit Q to lie within the range 0.02 < Q < few. The value 
of the Hubble parameter is known to within about a a factor of two to be 0.4 < 
h < 1 where h is the dimensionless Hubble parameter expressed in units of 100 
km s _1 Mpc" 1 . The uncertainties in H and Q are of course related, because 
of the definition of Q (|3|). The value of the cosmological constant is equally 
uncertain, being loosely constrained by the inequality |Oa| — \A\/3H 2 < 0.6 
with 90% confidence 



From ([!]) & (HD we can obtain analytical forms for the scale factor for some 
physically interesting cases. 



2.1.2. Spatially Flat Universe with One Matter Component 

In this case we have O = 1 and the scale factor evolves according to 



i 



for a perfect fluid with 



P 



3{l + w) 
P 

w = — 
P 

3p- 1 

v = — ; r 

2(p-l) 

and r) is the conformal time coordinate 



a{t) 



Some special instances of eq. (01) are: 



(i) pressureless matter or dust (w — 0): a(t) oc t 2 / 3 , a(rj) oc rj 2 

(ii) radiation (w = |) : a(t) oc a{rj) oc 77. 



(5) 



/ dt 

V = -77s- (6) 
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2.1.3. Spatially Flat Models with Two Non-interacting Fluids 
Here the interesting solutions are: 

(i) dust + radiation: 0(77) oc 77 (77 + 770), 



(ii) radiation + stiff matter (w = 1): 0(77) oc yr] (77 + 770), 

(iii) dust + cosmological constant: a(t) oc (sinh|y^|ct) 



2.1-4- Spatially Curved Models 

In this case we divide into open models (O < 1), where we have 

(i) Dust: 0,(77) = A(cosli77 — 1), ct = A(smhi] — 77) 

(ii) Radiation: 0.(77) = v4'sinli77, ct = v4'(cosli77 — 1), 

while for closed modes (Q > 1) we have 

(i) Dust: 0(77) = A(l — cosrj), ct = A(r) — sin 77) 

(ii) Radiation: 0(77) = A' sin 77, ct = A'(l — cos 77). 

2.1.5. Other Useful Relations 

It is sometimes convenient to express the Hubble parameter and the FRW 
equations in terms of the cosmological redshift parameter 

a(t) 

In terms of this new parametrisation, the Hubble parameter in a multicom- 
ponent Universe consisting of several non-interacting matter species charac- 
terised by equations of state P a = w a p a is given in a very simple form: 



H(z) = H (l + z) 



For a spatially flat Universe 1 — J2 a ^« = and equation (g) reduces to 



H(z) = H (l+z) 
In a Universe consisting only of dust 



z 



,l+3u> 



(9) 



H(z) = H (l + z)(l + Q z)i (10) 
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In this case the expression for the density parameter turns out to be particu 
larly simple: 

n (i + z) 
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1 + tt Z 

for large values of flz » 1, Q(z) — > 1, indicating that the density always 
approaches the critical value at early times. 

A useful relationship between the cosmological time parameter t and the cos- 
mological redshift z can be obtained by differentiating 1 + z = ao/a(t) with 
respect to time so that dz/dt = —H(z)(l + z). For a Universe consisting only 
of dust this leads to 



#0*0) = / 



dz' 



Evaluating the integral in eq. 

i 



t(z) 

t(z) 
t(z) 



Hp 1 

i - a 







7T 



^o-12 



2tf ~ 1 



l + z 

^0 



+ 



{1 + Z') 2 {1+Q Z')2 

we get [ p79(| : 



'121 



2(i-n )5 

^0 

(n -i)2 (n -i)2 



-log 



1+flo*) 5 -(l-O )2 



i + cv) 2 + (i -fi ) 2 



arctan 



i + n z i 

-12 



^0 



n ^) 2 



1 + 2 



3(1 + 2)2 

for the cases f^o < 1, > 1 and Qo — 1, respectively 

For a spatially flat Universe with a cosmological constant (Qa 
present age of the Universe is given by ||318|| : 



(13) 



£ n = 6.5/T 1 



log 



l + VT^n 



In the limit Q — > 1 we recover to = 6.5 h _1 Gyr. 



x 10 yr. 



Q) the 



(14) 



2.2. Gravitational Instability: Linear Theory 



Having briefly described some properties of the evolution equations governing 
the expansion of a homogeneous and isotropic Universe, let us now consider 
small departures from homogeneity and isotropy which eventually grow to 
become galaxies and other cosmic structures. The line element which appro- 
priately describes the perturbed FRW Universe in the synchronous gauge, the 
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most convenient gauge choice for many purposes, has the form 



ds 2 = ds 2 FRW + h a pdx a dx 13 , (15) 

where a,j3 = 1,2,3, and ds 2 FRW is the FRW line element given by eq. ([!]). 
In the linearised approximation which we shall follow, perturbations of the 
metric are assumed to be small h a ph af3 << 1, and the length scale of the 
perturbations is always much smaller than the Horizon scale Xh — 2ct so that 
a Newtonian treatment of the subject is valid. The relativistic generalisation 
of the linearised treatment of gravitational instability presented here has been 
developed in several papers BgBl , p5|j522| , |182| , |26o1 ,fe|. 



If the mean free path of a particle is small, matter can be treated as an 
ideal fluid and the Newtonian equations governing the motion of gravitating 
collisionless particles in an expanding Universe can be written in terms of 
x = f/a (the comoving spatial coordinate), v = f — Hr = ax (the peculiar 
velocity field), 4>(x,t) (the peculiar Newtonian gravitational potential) and 
p(x,t) (the matter density). In this manner we obtain the following set of 
equations 



First, the Euler equation, 

d(av) / * \ 1 - 2 . 

J_2 + . Vx )v = — V,P - V,0 16 
at p 

The second term on the right hand side of eq. ( |I6"D is the peculiar gravitational 
force responsible for the acceleration of the fluid element, which we shall denote 
g = — V x (j)/a. If the velocity flow is irrotational, v can be rewritten in terms 
of a velocity potential (fi v : v = —V<fi v /a. For a polytropic fluid with equation 
of state P oc p 1 the Euler equation can be integrated to give the Bernoulli 
equation 

1 ^ , X2 7 P 



(V x 0„) 2 = ^-- + 0. (17) 



dt id 2 v 7-1 p 

Next we have the continuity equation: 

% + 3Hp + -V x (pv) = 0, (18) 
dt a 

or, equivalently, 

* + + „«, = „. (19) 

And, finally, the Poisson equation: 

= AnGa 2 (p - p ) = 4:7rGa 2 p 5, (20) 

where p is the mean background density and 5 = p/p — 1 is the density 
contrast. 
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Expanding p, v and perturbatively and keeping only the first-order terms 
in equations (0), ( fL8f) an d ( ^0|) gives the linearised continuity equation: 



— = — V x ■ v or, equivalently, 5 = — - (v x • v) . (21) 

at a an f v ' 

The function / is given by 

dlogtf . . n o.6 , fi A f, . 



J rfloga 70 V 2 7 V ; 

where f^A = A/3Hq is the effective fraction of the critical energy density 
contributed by a cosmological constant term ||33C , 440 . 444 , 365 ] . 

The linearised Euler and Poisson equations are as follows: 

dv a _ 1 -> 1 -± . . 

— + -v = V x P-~V x <f> 23 

at a pa a 

V 2 = A7rGa 2 p 5 (24) 
(\v\, 1 0|, |d>| << 1 in equations (^j])- (H))- From these equations we can obtain 
a second order differential equation for the linearised density contrast S 

d 2 5 85 V 2 P 

_ + 2 tf__4^=-^ (25) 

Ignoring pressure forces for the moment we obtain 

5 + 2H5 - AttG Po 5 = 0, (26) 
which can also be rewritten as 

6 + 2H5 - ^QH 2 5 = 0. (27) 

For a spatially flat matter-dominated Universe po = {QiTGt 2 )^ 1 , and eq. (|26| ) 
admits two linearly independent power law solutions S(x, t) = D±(t)5(x) 
where D + (t) oc a(t) oc t» is the growing mode and -D_(£) oc t^ 1 is the decaying 
mode. For initial conditions demanding 5 = 8i and 8i — at t = t\ (i.e. the 
perturbation begins to grow from rest), the final form of 5 is a superposition 
of increasing and decreasing modes: 

w^M^+Md' 1 ' (28) 

i.e. three fifths of the initial amplitude is in the increasing mode and the 
remaining two fifths in the decreasing mode. Substituting 5(x,t) = D + (t)S(x) 
in eq. ([24]) we obtain the linearised Poisson equation 

VaV = ^GD + (t)5(x )p a 2 (29) 
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In a spatially flat, matter-dominated Universe: po oc a , D + (t) cx a(t), and 
-> 2 

we find that V x oc i.e. the linearised gravitational potential does not 
evolve with time. 



From equations ([21|)-(|24]), ignoring pressure, we get 



2f -* , const . , 

— . (30) 



3QHa XY a(t) 

Equation (|30|) demonstrates that the velocity flow associated with the grow- 
ing mode in the linear regime is potential: v = —V<j) v /a where cj) v = A<p, A = 
2f /3QH, a fact that is used to great advantage in non-linear approximations 
of gravitational clustering such as the Zeldovich approximation (§4.2) and in 
reconstruction methods such as POTENT (§8.10). In the absence of a gravi- 
tational force field g = —V x <j)/a = 0, so the peculiar velocity in an expanding 
Universe decays kinematically as v cx a(t) _1 ||440| . 



Since 



g = = Gp a(t) / rfV \ JK (31) 

a J \x — xr 



we find that the peculiar gravitational acceleration associated with a given 
mode is simply 

g± cx p aD±(t). (32) 



From equations (^) and ( |32|) we find that the peculiar velocity and the pe- 
culiar acceleration are parallel to one another, and that in a flat matter- 
dominated Universe, the velocities associated with the increasing and decreas- 

1 4 

ing modes respectively are v + cx ts and V- cx t~3. An expression similar to 
equation (|30|) can also be derived for an isolated spherical overdensity (§3.1). 

The linearised equations of motion provide an excellent description of gravita- 
tional instability at very early times when density fluctuations are still small 
(5 1). Linear theory may also provide a qualitatively good description of 
large scale clustering in hierarchical scenarios (such as the cold dark matter 
model which will be considered shortly) in which large scale power is sup- 
pressed relative to small scale power in the fluctuation spectrum leading to 
5 » 1 on scales < 1 Mpc, while 5 < 1 on scales greater than 10 — 30 Mpc. 

For spatially open and closed FRW models it is convenient to find general so- 
lutions of eq. ( ^7|) by noting that the decreasing mode is simply proportional 
to the Hubble parameter |[254| , |579|| . To see this, differentiate the Einstein equa- 
tion H+H 2 = — 4:7iG(p— 2A)/3, and use the continuity condition p = —3Hp , 
leading to H + 2HH — AnGpH = 0. This shows that D^it) cx H is indeed a 
solution to eq. (p6|). The second solution of eq. (|26|) can be constructed from 
D_(t) and the Wronskian of eq. @: W{t) = D + D_ - D_D + cx a~ 2 . This 



10 



gives 



D+(t) = D-(t) J W(t')DZ 2 (t')dt' = H(t) J 



dt' 



a 2 (t')H 2 (t') 



Hfa) 



da 



(Ha) 



Substituting a(z) = ao(l + z) 1 with H(z) denned in ([10|) we obtain 

dz' 



a J 



Equation 



,# r 2 (i+^)(i+^v)2" J 

can be integrated to give 



+ z') 2 (l + fi z / ) 3/2 ' 



(33) 



(34) 



= H FTTo + 3fi o — -s fi^o, z) 



II -finl 2 



where 



f(fto, z) = arctan 



{i + n z)l + (i_fl )2 
1 + ^)1 -d-On)^ 



(35) 



1 + tt z 

n -i 



(36) 



for the cases f2 < 1 and fig > 1, respectively. In the simpler case where 
Q = 1, we have 

D+(z) oc (1 + 2)- 1 (37) 
A good approximation to D + (z) for Q < 1 can be provided by the fitting 



formula 579 



DJz) 



l + ^o 



3 5 N _1 
1 + -flo + 



(3* 



From equations ( |3"5D & fl3"E|), we find that, for z » Qq 1 , D + oc z~ l - a 
consequence of the fact that the Universe at early times expands at the critical 
rate a oc t 2 ^ 3 regardless of the value of Qq. For open models we find that, 
at late times (corresponding to z « ^o" 1 ), the growth of D + freezes to a 
constant value due to the rapid expansion of the Universe a oc t, caused by 
the dominance of the curvature term in the equations of motion (Q). The 
associated velocity perturbation first grows according to (1 + z)~ l l 2 and then 
decreases as 1 + z. 

Although the decreasing mode can be important in some circumstances, we 
shall hereafter mainly deal only with the increasing mode because this is re- 
sponsible for the formation of cosmic structure in the gravitational instability 
picture. 
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In the preceding discussion we limited our attention to gravitational instabil- 
ity in a single component Universe. We shall now extend it to more general 
situations by considering the behavior of linearised perturbations in a Universe 
consisting of, firstly, pressureless matter (dust) and radiation, and, secondly, 
dust and a cosmological constant. In both cases we shall consider flat FRW 
models. The resulting equations in this case are 



8 + 2 (?-)<)- IxGp,,,* 

2 



8ttG 



{Prn + Px) 





8ttG 



-Ptot 



(39) 



where in the first case px = Pr oc a~ 4 , and in the second px = A/8nG = 
constant. We have here assumed that the radiation component is unperturbed 
which is generally not exact, but this calculation serves to illustrate a qualita- 
tive point. From eq. (^) it is clear that the presence of an additional smooth 
component will, by increasing the rate of expansion of the Universe, inhibit 
the growth of 5 |[240| , [393|]23CI|j472|| . This effect can be quantitatively demon- 
strated by solving eq. (|39|) with p x = p r which after a change of variables 
Pm/ p r oc a{t) becomes [|393| , [230| , |l69 



T 



d 2 5 (2 + 3r) d5 _ 3 
d^ + 2t(1 + t) d7 ~ 2r(l + r)' 



0. 



(40) 



Equation (|40f) has the growing solution D + oc 1 + 3r/2, which demonstrates 
that fluctuations remain frozen so long as the Universe is radiation dominated 
(r << 1), and begin to grow only after matter-domination (r > 1). 



A similar effect (but in reverse chronological order) occurs for a Universe 
with a cosmological constant. The decaying mode in this case is given by 
D_(a) oc H(a), with H(a) given by (0), D + (a) is given by (|33f) . For a 
spatially flat Universe with + Q m = 1: 



D(a) oc a 



1 + ?T a 



D + (a) = A 5 - B J 5 -, 2 -) 



(41) 



where 



[l + z)- 



(42) 



A is an arbitrary constant, B x (a,f3) is the incomplete Beta function [j52| . As 



in open models, the growth of density perturbations slows down considerably 



during later epochs z < z* = (Qa/^. 



,1/3 



1, when the expansion of the 



Universe begins to be driven by the cosmological constant 



. At very 
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late times D + (a) approaches a constant value 



ss.w-^&DiW- (43 » 



In closed models of the Universe the presence of a cosmological constant leads 
to additional interesting possibilities including an intermediate loitering or 
coasting stage which arises because of the mutual cancellation of terms involv- 
ing matter, the curvature term and the cosmological constant in equation (H) 
leading to an epoch when the scale factor does not change significantly with 
time: a(t) ~ constant |[451| , |475|| . Such models are interesting because during 
loitering density perturbations grow very rapidly: 

D+(t) oc -exp jy ^TrGpdtj , (44) 

approaching the exponential Jeans rate which arises in a nonexpanding Uni- 
verse. In addition to growing large amplitude inhomogeneities from small ini- 
tial conditions, a loitering Universe can also be very old, much older than Hq 1 . 
This might ameliorate the age problem which arises in a flat matter-dominated 
FRW Universe having t = 2H 1 /3, which for Hq 1 = 1 - 2 x 10 10 years is 
somewhat smaller than the estimated age of globular clusters, 1.6 x 10 10 years 
[ 2 75 1 . As shown in [475] , loitering is not confined to cosmological models with 



a cosmological constant, but in fact can arise in all closed FRW models in 
which in addition to normal matter there exists a form of matter whose equa- 
tion of state violates the strong energy condition so that w = P/p < —1/3 
(w = —1 for a cosmological constant). The loitering epoch z\ can be related 
to the density parameter of normal matter Qq, and the equation of state w by 

|1 + 3w\ 

^ = (l + z^ + Swz.-l (45) 

For instance if O = 0.1 then loitering must occur between 2 < z\ < 7.2. Other 
features of a loitering Universe are discussed in ||475|,|167|,|190|,f440|,|517|| . 



2.3. Pressure Effects; the Jeans Length 



In our treatment of density inhomogeneities we have so far ignored the effect 
of pressure terms, choosing to work with eq. ([ 
matter pressure into the analysis we recover 



rather than (|25|). Including 



d 2 S nIJ dS 



V X 2 S - 47iG Po 6 = 0. 



(46) 
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Expanding S in a Fourier series S(x,t) = Y^k^kGxp(ikx) we obtain 

5 k + 2H5 k + uo 2 k 5 k = 

- AnGp = ul (47) 




where k = \k\ = 2ira/\ is the comoving wavenumber and v s = J dP/dp is the 
speed of sound. 

Equation (f47|) is effectively a damped oscillator equation, the damping being 
caused by the expansion of the Universe. The condition ul = defines the 
so-called Jeans wavelength: 



i 



7T 



A,=^-j 2 . (48) 

The mass contained within a sphere with diameter Aj is the associated Jeans 
mass. Perturbations on scales A > Aj remain unaffected by pressure forces 
and continue to grow in accordance with the relations discussed in §2.3. On 
the other hand, small scale perturbations feel the effects of pressure gradients 
and oscillate as acoustic waves with a steadily decreasing amplitude. 

Prior to recombination, Thomson scattering couples photons and electrons 
together, as a result of which the two components matter and radiation behave 
effectively as a single fluid in which the adiabatic speed of sound is |169 



( ^ElL 



V3 \4p 7 



\ -1/2 

+ 1 . (49) 



Substituting p r = 3/ \32nGt 2 ) and v s ~ c/a/3 in eq. ([48]), we obtain Aj = act 
where a = 4v / 2tt/3. As a result we find that the Jeans length scales with the 
cosmological horizon during the radiation dominated epoch. The associated 
Jeans mass at recombination has the value 

Mj~9x I0 w (tth 2 )- 2 M Q (50) 

which is similar to the estimated masses of superclusters of galaxies. 

After recombination is complete photons cease coupling to electrons (which 
are no longer free) so that the pressure support is now provided by neutral 
hydrogen atoms instead of by radiation. As a result the speed of sound drops 
abruptly to v s ~ (5/csT/3m p ) 1//2 (m p being the proton mass and ks is Boltz- 
mann's constant), the corresponding value of the Jeans mass falls by over ten 
orders of magnitude to ||169| 



Mj ~ 1.3 x 10 5 (nh 2 )'^M Q (51) 
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which is similar to the mass of a globular cluster. 

On scales smaller than the Jeans scale dissipative processes define yet another 
length scale below which the amplitude of density perturbations is exponen- 
tially damped. This scale (set by the so-called Silk length A s ) arises because 
although Thomson scattering couples photons and baryons together before 
recombination, this coupling weakens as recombination is approached (due to 
the increasing mean free path of the photon) leading photons to leak out from 
overdense regions and into underdense regions carrying the still tightly coupled 
electrons with them. This leads to an exponential damping of inhomogeneities 
in the photon-baryonic plasma on scales A < A s , where X s is the Silk length 
at recombination, the associated Silk mass is [|169|| 

M s ~ 1.3 x l0 12 (tth 2 )- 3/2 M Q (52) 



2.4- Statistics of Primordial Density Fluctuations 



So far we have talked only about the evolution of a single Fourier mode of 
the density field 6(x,t) = D + (t)8(x). In the general case, however, the density 
field will consist of a stochastic superposition of such modes with different 
amplitudes. Writing the Fourier transform of S(x) as 

which has inverse 

5(x) = J d 3 ke i£2 5(k), (54) 

it is useful to specify the properties of 5 in terms of 6. Recall the Dirac delta- 
function: 

5 D (k) = ^J d 3 xe^. (55) 

In terms of this function, we can define the power-spectrum of the field to be 
the autocovariance of S(k): 

(5(h)5(k 2 )) = P(k 1 )5 D (k 1 + k 2 ). (56) 

The presence of the delta-function in this definition takes account of the sym- 
metry and reality requirements for P(k) [f43"| . The analogous quantity in real 
space is called the two-point correlation function or, more correctly, the au- 
tocovariance function: 

(5(f 1 ) ( 5(f 2 ))=e(|x 1 -f 2 |)=e(r), (57) 

which is related to the power spectrum by the Wiener-Khintchin relations 
which are discussed in §8.4. Both the power spectrum and the covariance 
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function of the present-day density field can be estimated by calculating anal- 
ogous quantities for discrete galaxy counts, as described in §8.5. 

The power spectrum or, equivalently, the autocovariance function, of the den- 
sity field is a particularly important statistic because it provides a complete 
statistical characterisation of the density field for a particular kind of stochas- 
tic field: a Gaussian Random Field. This class of field is the generic prediction 
of inflationary models where the density fluctuations are generated by quan- 
tum fluctuations in a scalar field during the inflationary phaseQ. We shall 
assume for virtually all of this paper that the primordial 5(x) is a Gaussian 
random field. Formally, this means that the Fourier modes have a Rayleigh 
distribution of amplitudes and the phases are independent and uniformly ran- 
dom on [0,27r]. Writing 

S(k) = \S k \ exp(i(p k ) (58) 

gives 

dP(\5 k \) = ^exp (-^Pj 141 d|5 k |, (59) 

where a 2 oc P(k). All the joint probability distributions of S(xi) . . . 6(x n ) are 
n-variate Gaussian distributions for this type of field and this property makes 
many properties of the spatial distribution of density fluctuations calculable 
analytically |T6Tl,|6|, |286| , p2l p3| . 




Many other physically-interesting properties can be related to the power spec- 
trum. For example, consider the mean square density fluctuation^ 

= (5(x)8(x)) = An J P(k)k 2 dk. (60) 
o 

This definition of the mean-square fluctuation takes into account contributions 
on all scales and may, depending on the form of P(k), actually diverge. A more 
physically realistic quantity to use is the mean square fluctuation smoothed 
on a given scale R, which can be shown to be 

a 2 (R) = J d 3 kW 2 (kR)P(k), (61) 

where W(x) is the Fourier transform of the filter function used to smooth the 
density field in real space, for example, a spherical top-hat function in real 
space gives 

3 

W(x) = — (sinx — xcosx); (62) 
x 6 

3 Gaussian random processes also occur generically in linear physics due to the central limit 
theorem. 

4 Both the spatial average and the ensamble average are equivalent for a Gaussian random 
field, which is said to be ergodic. 
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whereas a Gaussian real-space filter function leads to a Gaussian Fourier do- 
main function (which is a useful property) .0 In the linear regime, the density 
field retains its initial Gaussian character, so that 



a R (t)V27i 



°R 



2a 2 R (t) 



(63) 



For a Gaussian distribution all moments of order n > 2 are either zero (odd) or 
can be expressed in terms of a 2 (even). The cumulants, or connected moments 
are zero for n > 2 for such a distribution, §8.3. 



Furthermore, assuming the linearised continuity equation and Poisson equa- 
tions are valid, one can calculate such quantities as the mean square velocity 
smoothed on a scale R and the mean square gravitational potential fluctua- 
tion. It is useful to express these, and the mass fluctuation, in terms of the 
contribution from a logarithmic interval in fc-space: 

da 2 

47rk 3 P(k); 



dlogk 
dcr 2 

dff * -4,fW a A 2 ^. (64) 



dlogk V2 
We discuss the use of velocity statistics in §8.10. 

The general picture that emerges is that, while the amplitude of each Fourier 
mode remains small, i.e. S(k) <C 1, linear theory applies. In this regime, each 
Fourier mode evolves independently and the power-spectrum therefore just 
scales as P(k,t) = P(k,ti)D+(k,t)/D+(k,ti). Even if 5 > 1 for large k, one 
can still apply linear theory on large scales by applying a filter as discussed 
above to remove the non-linear scales. Generally, it is assumed that a scale 
evolves linearly as long as <J 2 (R) < 1, though as we shall see this is not always 
correct. For scales larger than the Jeans length, this means that the shape of 
the power spectrum is preserved during linear evolution. Only when scales go 
non-linear does the shape of the spectrum begin to change. Of course, scales 
below the Jeans length do not grow relative to those above and so the shape of 
the spectrum is distorted even in the linear phase. In the popular models with 
non-baryonic dark matter, and models which have significant Silk damping 
there are other effects suppressing the growth of fluctuations in the linear 
regime. 

To complete the picture we need to specify the shape of some initial fluctuation 
spectrum which exists at some arbitrary time before it can be distorted by 

5 Useful mathematical properties of the top-hat window function are discussed in [pa . 
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damping processes. The inflationary scenario often produces a power-law form 



P(k) = Ak n , (65) 
with many models having the Harrison-Zel'dovich form n = 1. Even if infla- 



tion is not the origin of density fluctuations, the form fl65|) is a useful phe- 
nomenological model for the fluctuation spectrum. Given the current plethora 
of Inflationary Universe models, the value of the primordial amplitude A is 
usually determined empirically [ |324| . 



First attempts to normalise P(k) used counts of galaxies. Because one wants 
to use linear theory to normalise the spectrum - and thus avoid any physical 
process which distorts the spectrum's shape, rather than its amplitude - it is 
necessary to measure fluctuations on rather large scales. A useful way to do 
this is using J 3 , which is defined by 

? p3 

J 3 (R) = J £(r)r 2 dr = —J d 3 kW 2 (kR)P(k) (66) 
o 

where W(x) is defined by eq. fl62|). Alternatively, one can measure the vari- 
ance of galaxy counts-in-cells on a scale R, using eq. (|6l"l). These methods are 
discussed in §8.5. The problem with such attempts to normalise the spectrum 
are two-fold. First, there is the problem that redshift surveys probe only a 
comparatively small volume of space and we need large scales for linear the- 
ory to be reliable. Secondly, and more fundamental, is the problem that we 
do not know how to relate galaxy counts and mass fluctuations in an unam- 
biguous way. This ignorance is usually parametrised by the phenomenological 
relationship 

(£)-(?)• 

but the bias factoiQ b may well be scale-dependent p86|ri57|j290| , p2| ,p3i 



(67) 



The fact that the velocity flow induced by density perturbations is sensitive 
to the total mass fluctuation, not just the part that is in galaxies, suggests 
that one should use statistics of the velocity field to normalise the spectrum. 
This idea is discussed in §8.1.5 & §8.10. The problem with this is that our 
knowledge of the detailed statistics of the velocity field is rather poor. 

6 Although we shall deal extensively with the origin of structure in this paper, we shall not 
deal with the even harder problem of how individual galaxies form. Quite apart from the 
need to accumulate enough mass to form a galactic halo, one also needs to understand star 
formation, hydrodynamical processes during the protogalactic collapse and the possible role 
of environmental feedback processes on the forming object. All this physics is encapsulated 
in the bias parameter b. That an attempt should be made to embody such complicated 
effects in a single parameter is ample evidence that we are not close to a theory of galaxy 
formation. 
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As we shall see in §2.7, the COBE satellite's detection of CMBR anisotropy 
has allowed us to circumvent these difficulties and obtain a relatively model- 
independent normalisation of the power spectrum appropriate for models of 
structure formation. Before we discuss this, however, we need to look in more 
detail at models of structure formation involving non-baryonic dark matter. 

2.5. Non-Baryonic Dark Matter 

We would now like to describe currently popular cosmological models in which 
the matter content of the Universe is usually considered to be in the form of 
two non-interacting fluids. The first is presumed to be dissipationless and non- 
baryonic whereas the second is made up of baryons (WIMPs such as massive 
neutrinos and SUSY particles, or axions, are possible candidates for nonbary- 
onic matterTJ, whereas the baryonic material is presumed to be composed 
predominantly of neutral hydrogen and helium at early times). The need to 
incorporate non-baryonic forms of matter into the standard big bang model 
arises because of the necessity to generate large enough perturbations to ac- 
count for the presence of galaxies, from sufficiently small initial values of the 
density contrast 5p/p, without violating the CMBR constraints on small, i.e. 
arc minute angular scales. This is a difficult task in baryonic models in which 
perturbation growth takes place only after the cosmological recombination 
of hydrogen, which occurs at redshifts ~ 1100. Before recombination, radia- 
tion pressure caused by Thomson scattering effectively prevents the growth of 
all perturbations having wavelenths smaller than ~ 180/i -1 Mpc, the horizon 
size at recombination. After recombination, density perturbations grow lin- 
early with the scale factor of the Universe, a(t), until the Universe becomes 
dominated by curvature or by a cosmological constant, after which time per- 
turbation growth is strongly suppressed (if Sl < 1)- 

The maximum permitted growth factor for perturbations is therefore 5/5 rec < 
«o/ a rec — 1100 (for ft < 1 ). Consequently, the requirement that 5 > 1 today 
- which is a necessary (but not sufficient) condition for the formation of gravi- 
tationally bound systems such as galaxies - leads to the primordial amplitude: 
5 ~ 10~ 3 at recombination. If the primordial perturbations are adiabatic (i.e. 
fluctuations for which the entropy per baryon is spatially constant), then at 
recombination 



so that a fluctuation amplitude ~ 10~ 3 in the baryon component would in- 
variably result in an anisotropy ~ 10~ 4 in the CMBR temparature on arc 
minute scales which has not been observed. On the other hand, perturbations 
in a nonbaryonic component, can begin growing soon after matter-radiation 

7 WIMP means Weakly Interacting Massive Particle. 
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equality has been achieved (i.e. by z ~ 10 4 in the case of an Q = 1 Universe) 
with the result that large final perturbations can develop from small initial 
values without violating the CMBR constraints on small scales [|443|,|507|J516 



The existence of non-baryonic matter is also strongly suggested by the ob- 
served density of matter in the Universe parametrised by Qq. Although the 
value of f2o is not precisely known, most observations place it in the range 
bounded by 0.1 < Q < 1 with some indications that Q could be as large as 
unity, in agreement with predictions made by the Inflationary Universe sce- 
nario |290|j414| , |l55| , |125| , |15ij| | . A value of Q greater than ~ 0.1 would require the 
presence of some form of non-baryonic matter since primordial nucleosynthesis 
imposes rather stringent constraints on the contribution to Qq from baryons: 

n b ~ o.oi6/i- 2 gm 



Gravitational instability in a two component medium with Q = 1 consisting 
of baryons and dark matter is governed by the following coupled system of 
equations (again to linear order): 

5\ + 2 (~) Si + (^\ 5i = 47rG(Mi + P2S2) 

h + 2 5 2 + 5 2 = 4ttG(M + P282) 

a\ 2 8tiG . . . , 

- =^ r (Pi+P 2 ). (69) 

If either k = 2na/X = 0, or v 1 = v 2 = then equations have the solutions 



554 



S x = B-iP + B 2 r l + B 3 + B 4 r 1/3 
8 2 = Brfi + B 2 r l + B 5 + B 6 r 1/3 
= ^Bz + Vl 2 B 5 = fix5 4 + n 2 B 6 . (70) 



It is interesting that, of the four linearly independent components in equa- 
tion (f70|)) only one corresponds to a growing mode. An important subclass 
of equations ( E^ ) & ( [70] ) arises if p\8\ » P2^2- This condition is satisfied 
in most cosmological models since the perturbation in the dark matter com- 
ponent immediately after recombination is much larger than the correspond- 
ing baryonic perturbation (pi#i — 10p 2 <5 2 ), consequently we can ignore the 
presence of p 2 8 2 terms in the right hand side of eqs. ( |69|) which leads to: 
Si oc a(t),S 2 oc 8i(l — const/a), indicating that perturbations in baryons 
rapidly fall into potential wells created by dark matter, with the result that the 
growth rate of both types of perturbations converges at late times. The behav- 
ior of perturbations in both baryonic and nonbaryonic matter after matter- 
radiation equality is illustrated in Figure ([I]). 
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Fig. 1. The growth of perturbations in baryons (solid line) and non-baryonic dark 
matter (dashed line - 'x') is plotted against the cosmological redshift parameter z 
for different values of Q;, and Q x . Adapted, with permission, from [507]. 



For other physically interesting solutions to the system of equations (|S§[), see 
228|p^TO^j4§g|j52lU5^ . For a general equation of state, solutions to 



these equations can be expressed in terms of the Meijer G-function and can 
therefore be studied by analysing the well documented properties of this func- 
tion [29|. A relativistic generalisation of equations (|69|) can also be obtained ||5 22| , |5 25 , 473 



Non-baryonic matter does not, in general, couple to photons, so that pertur- 
bations in it are not subject to collisional (Silk) damping. However depending 
upon the momentum of the non-baryonic particle, collisionless phase mix- 
ing caused primarily by the free streaming of relativistic particles as these 
travel from regions of high density into regions of low density (and vice versa) 
leads to a very effective mechanism whereby fluctuations on scales smaller 
than the free- streaming distance Xf s are wiped out; A/ s is the mean dis- 
tance traversed by a relativistic particle species until its momentum becomes 
non-relativistic. The cosmological properties of non-baryonic matter depend 
mainly upon whether Xf s is large or small. Particles for which the free-streaming 
distance Xf s is large are said to constitute Hot Dark Matter (HDM) (to un- 
derscore the relativistic nature of the particle species). In the opposite case 
when Xf s is too small to be cosmologically interesting, the particles are said 
to constitute Cold Dark Matter (CDM) 
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For both HDM and CDM the shape of the processed final spectrum of fluctua- 
tions differs from its primordial shape. In the case of HDM this arises because 
fluctuations on scales smaller than Xf s are wiped out due to free stream- 
ing so that the processed final spectrum has a well-defined cutoff on scales 
smaller than A ~ A/ s . For HDM models based on a light massive neutrino 
\f s ~ 41(30 eV/m^) Mpc, the associated mass scale for neutrinos of mass 
m u ~ 30 eV is Mf s ~ lO 15 (f2/i 2 )~ 2 M which is similar to the observed mass 
of a rich cluster of galaxies |63| , |507|| . Consequently, the first objects to col- 



lapse in such a model have the mass of a cluster, smaller objects forming later 
by the fragmentation of primordial cluster-scale "pancakes" . This scenario is 
commonly called the "top-down" scenario for galaxy formation and was orig- 
inally suggested by Zeldovich and co-workers in the 1970s in connection with 



baryonic models of galaxy formation ||507 



If either the mass of the nonbaryonic particle is very large (as in the case of 
SUSY particles such as the photino and the gravitino), or if its momentum 
is very small due to it being formed out of thermal equilibrium (e.g. the 
axion), then the free streaming distance A/ s is too small to be relevant and 
the processed spectrum has no cutoff on any cosmological scale. However, 
even in such cases, the final spectrum does show an appreciable departure 
from its primordial form Po(k) on scales smaller than X eq (= 13/f2/i 2 Mpc) 
(X eq is related to the comoving horizon scale at matter-radiation equality) 
approaching the asymptotic form P(k) oc Po(k) x A; -4 log k for k >> k eq 
||525| . [162|| . This bending of the CDM spectrum occurs on scales ~ X eq and arises 
because small scale density fluctuations (having wavelength A < Xh < X eq ) 
which enter the horizon prior to matter-radiation equality have nothing to 
gravitate towards (the free-streaming of photons having effectively wiped out 
all fluctuations in the photon component on scales < X eq ). As a result their 
growth rate slows down considerably and approaches the asymptotic form 
5 oc 1 + 3a(t)/2a eq until matter dominance (a = a eq ) ||393| , |462|| . Perturbations 



on scales greater than the cosmological horizon (A# < A < X eq ), however, 
continue to grow at the usual rate 5 oc a 2 , until they enter the horizon, resulting 
in a larger amplitude for longer wavelength perturbations and, consequently, a 
bend in the shape of P{k). Very large wavelength perturbations (A > X eq ) are 
left virtually unscathed by the radiation dominated epoch, since they enter the 
horizon after matter-radiation equality. As a result the fluctuation spectrum 
preserves its primordial form on scales > X eq . 

The combined effect of the various processes involved in changing the shape 
of the original power spectrum can be summarised in a single quantity - 
the transfer function T(k) which relates the processed power spectrum P(k) 
to its primordial form Po(k) via the transformation: P(k) = Po(k) x T 2 (k). 
Inflationary models of the Universe generically predict Po(k) oc k n , n = 1 being 
the scale invariant Harrison - Zeldovich spectrum ||219| . |575|| . The results of full 



numerical calculations of all the physical processes we have discussed can be 
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encoded in the transfer function of a particular model. The main examples are 
listed here. 



For Cold Dark Matter 



T{k) 



1 + 



(Akf 



log(l + Bk) 



(71) 



where A = 3.08^h~ 2 Mpc, B = l.83y/nh~ 2 Mpc, and k = fi re //f2 7 is the 
ratio of the energy density in relativistic particles to that in photons. The value 
of K = 1.68 corresponds to three neutrino flavours plus photons contributing 
to the total density in relativistic particles ||473| , |525|| . An alternative fitting- 
formula is: 



T(k) 



log(l + 2.34g) 
(2.34g) 



1 + 3.89g + (16. lg) 2 + 5.46g) 3 + (6.71qY 



(72) 



where q = k9^/{VLh 2 ) and 9 = k/1.68 [23|. In addition the following transfer 
function was given by Bond and Efstathiou 



2wi 



T(k) = [l + (ak + (bk)i + (cky) 



(73) 



where a = QA(VLh 2 )- 1 Mpc, b = 3.0(n/i 2 )~ 1 Mpc, c = l.l^h 2 )' 1 Mpc, 
v = 1.13. Approximations ( |71~1 , |72D correctly reproduce the small wavelength 
behaviour of the transfer function T(k) oc k~ 2 log k for k » k eq . A comparison 
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; see also [ 


442 


,55,313 


J314 
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For Hot Dark Matter we have 



T, 



HDM 



(74) 



by solving the Boltzmann equation for the collisionless neutrinos. In eq. (|74|), 
\ v = 2ir/k v = IZiVlvh 2 )- 1 Mpc = 41 (m„/30 eV) Mpc is the damping length 
in neutrino fluctuations caused by free streaming. For a single flavour neutrino 
having mass m u and temperature T u = 1.9° K, Q u h 2 = 0.31(m^/30 eV.). Sev- 
eral alternative fits to the HDM transfer function are given in Appendix G of 
including: 

T HDM ~exp[-(3.9g + 2.1g 2 )], 
where q = k/(Q u h 2 Mpc -1 ). 



(75) 



An alternative scenario is furnished if the primordial density fluctuations are 
in the isocurvature mode, rather than adiabatic. Isocurvature initial condi- 
tions are perturbations to the equation of state, rather than true curvature 
perturbations: the total energy density is constant, but the relative contri- 
bution from each fluid component varies spatially; such perturbations do not 
arise generically (as do adiabatic initial conditions, which arise as fluctuations 



23 



in the inflaton field and are therefore predicted in most Inflationary Universe 
models) but can be generated via specific mechanisms such as by an axionic 
field in the early Universe and also in a certain category of Inflationary models 



p24 |. 



The evolution of isocurvature perturbations before and after matter-radiation 
equality has been discussed in the framework of the two fluid CDM model. 
Starobinsky & Sahni |525| . |473| give the following form for the isocurvature 
transfer function: 

T(k) = (1 + Ak)~\ (76) 

where A = A.36^/K 1 h~ 2 Mpc. An alternative transfer function is given by Efs- 
tathiou & Bond JT7Tj 

T(k) = [1 + (ah + (bk)l + (c£;) 2 )T% (77) 

where a = Ih.^tth 2 )- 1 Mpc, b = O.^Qh 2 )- 1 Mpc, c = 5.8(fi/i 2 )- 1 Mpc, 
v = 1.24. An essential feature of the isocurvature transfer function is the 
absence of the logarithmic growth factor that characterises the small scale be- 
haviour of the adiabatic CDM spectrum, indicating that density perturbations 
on a range of scales (k » k eq ) grow to become non-linear virtually instan- 
taneously. Somewhat surprisingly, however, isocurvature fluctuations generate 
larger anisotropies in the CMBR than adiabatic fluctuations, for two reasons. 
Firstly, initially isocurvature fluctuations generate significant fluctuations in 
the gravitational potential when they enter the horizon; this is due to the 
influence of pressure gradients. In addition, isocurvature fluctuations generate 
significant fluctuations in the radiation density after matter-domination, be- 
cause the initial perturbation in the equation of state, which can be thought 
of as a perturbation in the entropy, is then transferred into the perturbation 
of the radiation. The upshot of all this is that the net anisotropy seen is a 
factor six larger for isocurvature fluctuations than for adiabatic ones, mak- 
ing such fluctuations uninteresting from a cosmological viewpoint [^25|J170 
though they have been discussed in the literature [p37j|314|Jl69|l. 



2.6. Normalising the Power-Spectrum 



We have already mentioned that the power spectrum could in principle be nor- 
malised using the observed clustering properties of galaxies. In practice, such 
a normalisation runs into trouble because of the absence of a strict relation- 
ship between the clustering properties of dark and luminous matter. Recent 
observations of the large scale anisotropy of the CMBR by the COBE satellite 
remove this ambiguity by relating temperature fluctuations ST/T directly to 
the amplitude and shape of the underlying power spectrum on scales that are 
still evolving according to linear theory |518| . Furthermore, having normalised 
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the power spectrum in such a manner, we can proceed to determine such 
quantities as 8M/M(r) which measure the clustering of dark matter and can 
therefore help predict the value of the biasing factor b = (SN/N)/(5M/M) for 
a given cosmological model. 

In our discussion of the CMBR anisotropy we shall focus mainly on scales 
greater than a few degrees, since the primordial anisotropy on such scales has 
been unambiguously measured. Observations on smaller scales have been used 
to constrain the shape of the power spectrum on these scales and also to pro- 
vide limits on the gravity wave contribution | [L38|JH3|J5^| , |563| |. The CMBR trav- 
els to us from the surface of last scattering whose comoving distance from us 
practically coincides with the horizon scale da = ZHq 1 — 6000/i -1 Mpc. The 
horizon at last scattering subtends an angle 6 ~ 1.8°fiJ (lOOO/z^) 1 / 2 ~ 1.8° 
for Q ~ 1 and z rec ~ 1000. As a result the CMBR anisotropy on angular 
scales greater than a few degrees probes length scales that were causally un- 
connected at the time of recombination, thereby providing us with the cleanest 
possible probe of the primordial spectrum prior to its distortion by astrophys- 
ical processes. 

The dominant contribution to the CMBR anisotropy on large angular scales 
(> 1°) arises because of the Sachs- Wolfe effect which relates temperature 
fluctuations in the CMBR to the integral over the variation of the metric 



evaluated along the line of sight [|471|| 



5T 1 7 dh a 



T 2 J dr] 



a e?dri, (7S 



where e a is the tangent vector of the photon trajectory, rj — J dt/a is the 
conformal time coordinate (r/ corresponds to the present epoch) and r\ rec to 
recombination; an analysis with three massless neutrino species and z rec = 
1100 gives | |404| | r] Q /r] rec = 49.6. In a flat, matter-dominated Universe, the 



(79) 



gravitational potential does not evolve with time and the above expression 
simplifies to 

ST 1 

T "3c 

In other words, the CMBR anisotropy is directly related to fluctuations in the 
gravitational potential on the surface of last scattering. 

Perturbations in the primordial gravitational potential (usually accompanied 
by gravitational waves) arise naturally in models of the Universe based on the 
inflationary scenario which usually predict a primordial fluctuation spectrum 
Po{k) = Ak n . The scale invariant spectrum n = 1 is predicted by inflationary 
models with exponential expansion, models with power law inflation (PLI) in 
which the scale factor grows as a power law a(t) oc t p , p > 1 predict a lower 
spectral index: n = (p — 3)/(p — 1) < 1. 
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It is convenient to express the cosmic microwave temperature distribution as 

ST.. 



T(9,, 



1 + 



T 



0) 



where T is the blackbody temperature T = 2.736 ± 0.010°fT |ET75| , g76|| . We 
can rewrite ST /T as a multipole expansion on the celestial sphere: 



ST 
~T 



% 0) = E E flftn^T 
Z=2 m=-l 



where 9, <fi are spherical angular cordinates associated with a direction n in the 
sky, Y l m (9, (ft) are spherical harmonics. We have ignored the dipole contribution 
to ST which cannot be distinguished from our local velocityfj with respect to 
the CMBR. The Sachs- Wolfe effect allows us to relate the multipoles ai m to 
the primordial spectrum 



a 



(S) 



aw 



fc2 (t) J§ p{ ^ kx 



(82) 



where x = 2c/ Hq is the present day Horizon size and ji(kx) are spherical 



Bessel functions. For power law spectra P(k) = Ak n the integral in eq. (82 
can be evaluated analytically, with the result 



(S) 



A 
16 



Ho 

c 



f{n,l); f(n,l) 



r(3-n) r[(2l + n- l)/2] 



T[(2l + 5 -n)/2] 



Of considerable importance is the temperature auto-correlation function de- 
fined by 

C(9) = (^(^i)^(n 2 )), (84) 

where fi\-hi — cos 9. Substituting ST /T from eq. (|8l]) and using the addition 
theorem for spherical harmonics we get 



C(9) = -^-Y,( 2l + 1 ) C i p i( cost 

47F l>2 



8 The dipole anisotropy is usually interpreted as being due to the motion of our local 
group of galaxies relative to the CMBR. For v/c « 1, T{9) ~ T (l + (v/c)cas6), 9 
being the angle at which the microwave temperature is being measured relative to the 
motion of the observer. Observations indicate ST /T\di po ie — 1.2 x 10~ 3 from which, after 
correcting for the motion of the Solar system within the Milky Way, we can infer the value 
Vlg — 610 ± 50km/s for the velocity of the mass center of the local group with respect to 
the CMBR. The direction of V LG is towards a = ll h , 8 = -25°, which is some 45° away 
from the nearest large mass concentration - the Virgo cluster, indicating that the Virgo 
cluster contributes only partially to our overall peculiar velocity, and implying thereby that 
more distant (and more massive) mass concentrations must exert a significant influence on 
the motion of the local group. 
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Equation (|83| ) allows us to relate the higher multipole moments directly to the 
quadrupole (for n < 3) 



a 



(S) 



Cf>" 



r 


) + 


r(^) 
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i + 


r(2±2) 



(86) 



The quantity Q| = (h / Ai^C^p F% is sometimes called the quadrupole anisotropy 
of the CMBR; F2 incorporates the finite beam width of the detector, F2 ~ 
0.99 for the COBE-DMR. We use the subscript "S" for scalar perturba- 
tions to distinguish it from the quadrupole anisotropy generated by gravi- 
es) 



tational waves. Relating C^' to A through eq. (|83|) and substituting [|57 
Qs = Qcobe-dmr - 6.46 x 10~ 6 we obtain 



Q 



2 _ Q2 

S ~ ^COBE-DMR 
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5127T' 



n+3 



(87) 



and thus the COBE-normalised amplitude 



A 



8 / c 



n+3 



Q 



COBE-DMR 



(n = 1.1 ±0.32 is indicated by COBE data ||563[| .) This normalisation however 
ignores the presence of gravitational waves which are generically produced in 
most inflationary scenarios and also contribute to the anisotropy in the CMBR 
by the Sachs- Wolfe effect |468| ,|3], |o^4]] . If the contribution of these modes to 
the quadrupole is Q\ then the total quadrupole seen by COBE will be 



Q — Qs Qt' 



(89) 



For particular inflationary models one can calculate the relative contributions 
of scalar and tensor modes [p24} , |152|J482| , |523| , [479| , p48| , p38|J335|| . For power-law 
inflation this gives 
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(90) 



where g(n,2) ~ exp0.6(n — 1), /(n, 2) is defined in equation ([831). Instead of 
normalising to the quadrupole we could, instead, normalise to the variance 
at 10°, this would lead to a slight change in the value of A ||337|| . Having 
normalised the primordial spectrum (at least for one particular inflation- 
ary model) we can now determine statistical indicators of clustering, such 
as 5M/M and £(r), for spatially-flat matter-dominated cosmological models 
such as the standard CDM scenario pB3t[13^ , p8|p8| , p3l , M^ , Pl . Ex- 
tensions of the model such as mixed dark matter (CHDM) or CDM plus a 
cosmological constant were considered in ^^ , |T^g56]gg7| , gTD| , gTg , g6J . 
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3. Local extensions of Linear Theory 



In this section we shall discuss the simplest class of non-linear approximations, 
those which involve extrapolations of the linear properties of the density field 
into the non-linear regime. This kind of model typically does not make any 
attempt to account for the dynamical evolution of clustering, but in some 
situations can give interesting insights into the properties of the non-linear 
mass distribution. 



3.1. The Spherical Model 



One of the simplest and best studied models of nonlinear gravitational in- 
stability is the spherical model. In this model we ignore the tidal effects of 
neighbouring density perturbations upon the evolution of an isolated, ho- 
mogeneous, spherical density perturbation. To justify this we can appeal to 
Birkhoff 's theorem in General Relativity, or Gauss's law in Newtonian Gravity. 
Under these simplifying assumptions an exact analytical treatment is possible 
j538j|70l, p7l , p5l1 . 



In order to understand the dynamics of non-linear spherical collapse, consider 
a spherical density perturbation expanding in the background of a homoge- 
neous and isotropic background Universe. The density of the fluctuation is 
characterised by Q' whereas that of the background Universe by Q (Q' > Q 
will correspond to an overdensity and ft' < Q to an underdensity) . The subse- 
quent fate of the spherical density perturbation will depend crucially upon the 
value of Q'. For Q' > 1 the perturbation will behave just like a part of a closed 
FRW Universe and will therefore expand to a maximum radius, turn around 
at a time t ta , and thereafter collapse to a point at t co u ~ 2t ta . A spherical den- 
sity perturbation with Q' < 1 on the other hand, will mimic an open Universe 
and never recollapse (if Q' < Q then such an underdensity will correspond to 
a void). In an idealised cosmological scenario spherical overdensities might be 
thought of as progenitors of clusters of galaxies ||286|| , whereas underdensities 
would correspond to voids. 

In order to treat the collapse of a spherical overdensity quantitatively let us 
consider a spherical shell of radius R with an initial overdensity 5i and a 
constant mass M = 47nR 3 p(l + <5j)/3, where p is the density of the backround 
Universe. Conservation of energy guarantees 

\R 2 - = E = const. (91) 

At early times the expansion of the shell is virtually indistinguishable from 
that of the rest of the Universe so that Ri = HiRi, R4 being the radius 
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of the shell and Hi is the Hubble parameter at an initial time t = t{. The 
kinetic energy of the shell is therefore Ki = ^HfRf and its potential energy 
is Ui = —GM/Ri = —KiQi(l + Si) where Qi is the density parameter at tf 
3HfQi/2 = AirGpi. As a result we obtain 

E = Ki + Ui = K^n; 1 - (1 + Si)]. (92) 

The requirement for collapse E < leads to the condition 1 + S{ > . 
Substituting fij = Q(z) = Qq(1 + z)/(1 + &oz), Si = S(z) we get 

,w > (93 » 



as a precondition for collapse to occur. Equation (|93|) indicates that in flat 
or closed cosmological models an infinitesimal initial density perturbation is 
sufficient to give rise to collapsed objects. In open models on the other hand 
S(z) must exceed a critical positive value in order for collapse to occur. 

It is relatively straightforward to relate the maximum expansion radius reached 
by an overdensity at turnaround Rt a to its "seed" values Ri, and Sj (equiv- 
alently R(z) and S(z)). Since the mass of a perturbation is conserved, and 
R\ ta = we get 

E = U ta = = --^A(l + St). (94) 

tita tita 

Equating ( |9~2"D and (0) we get 

Rt^ = l + Sj = 1 + S(z) 

Ri ^-(nr 1 -!)"^)-^" 1 j 

The time evolution of a spherical mass shell is identical to that of a spatially 
open or closed FRW Universe. The resulting equations of motion may be 
obtained by integrating eq. ( |9~TD giving | |440| , |579| |: 

R = A(l -cos0) 

t = B(6- sin6) (96) 

for the case E < 0, and 

R = A(cosh9- 1) 

t = B(smh6-6) (97) 

for the case E > 0. In eqs. (^) and (|97D we have A 3 = GMB 2 . The behaviour 
of the background Universe is described by similar equations. 

Setting 9 = n in equation (^) we can express the constants A and B in terms 
of the turnaround radius R ta and the turnaround time t ta : A = R ta /2,B = 
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tta/n. Next using eq. @ and the relationships B 2 = A 3 /GM, M = AnR 3 p/3 
and 8irGp = 3H 2 £l, we can re-express A and B in terms of R t and 5 { : 



2)5,- (a 



-1 



5 



1 + 5* 



2#fiJ /2 [^- (fi^ -1)] 3/2 " 



(9£ 



In a spatially flat Universe, eq. ( |98| ) becomes 



2&' 4 A ' 



(99) 



where we assume 8^ « 1. It is now relatively straightforward to compute 
the overdensity in each mass shell. Since mass is conserved we get, using 
M = AttR 3 p/3 and eq. fl9§), 



P(t) 



3M 



AnA 3 (l - cost 



(100) 



In a spatially flat matter dominated Universe the background density scales 
as 

1 1 

P ^> ~ QnGt 2 ~ 6tcGB 2 {6 -sinO) 2 ' 
So, combining eqs. ( |100| ) and ( |101| ), we get 

\2 



(101) 



5(9) 



P(t) 



1 = ?- 



sin i 



- 1, 



p(t) 2(l-cosfl) 3 

for positive density fluctuations, and 

9 (fl-sinhfl) 2 
m = 2(cosh^-l)3 ^ 1 

for negative density fluctuations. 

From equations ( |102| ) and (|99[) we recover the linear limit for small 9, t: 



(102) 



(103) 



lim 5(9) ~ ~ 
o-o v ; 20 20 \BJ 



3 /6i\ 2 / 3 3, /t\ 2 / 3 



(104) 



indicating that only 3/5th of the initial amplitude is in the growing mode. In 
view of eq. ( |104j ) the criticality condition fl9~3"|) translates into Si > 3(f2^ 1 — 1)/5 
or, equivalently H579H , 

r- 1 \ 3 1 — fin . . 

* (2) > stwtty (105) 
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Table 1 

Linear theory and exact overdensities 5 at various stages of the collapse in the 
spherical top-hat model. 






5l 


6 


-> 


8 L oc e 2 


5oc9 2 


TT 
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0.341 


0.466 


2tt 
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0.568 


1.01 


TT 


1.063 


4.6 


2tt 


1.686 


OC' 



From eq. ( |102| ) we find 5(6 = tt) ~ 4.6 at the radius of maximum expansion 
("turnaround"), and 5(2tt) — > oo at recollapse. The corresponding extrapo- 
lated linear density contrast can be found from equations (|104|) , (p6|) & (|99|): 

^(0)~jjQ 3 (0-sin0)§. (106) 

We thus obtain f^zX 71 ") — 1-063 for the linear density contrast at turnaround, 
and <5l(27t) ~ 1.686 at recollapse. These, and other, values of 5l are compared 
with the exact results 8 in the Table.^ 

Knowing the linear density contrast corresponding to a given perturbation, 
the redshift at which that perturbation "turned around" and "collapsed" can 
be found from 



1 + Z ta 



5, 



1.063 



In reality 5 co u — > oo will never be achieved since exact spherical collapse is at 
best a rather crude approximation, which will break down as the overdensity 
begins to contract, dynamical relaxation and shocks both ensuring that the 
system reaches virial equilibrium at a finite density. The maximum density 
at recollapse can be estimated using the virial theorem and the fact that at 
R = R ta all the energy in the system is potential: 

U(R = R mr ) = 2E = 2U(R = R ta ), (108) 

since U = —GM/R we get R vir = Rt a /2 and p vir = 8p ta [|437|| . The mean 



density of an object at turnaround is p ta / p = 5 ta + 1 — 5.6 so that p ta ~ 
5.6p ta - We therefore get p V i r ~ 8 x 5.6p ta . Since p = (67rGt 2 )~ 1 and setting 

9 It is interesting to note in this context that in a low fl Universe a spherical density contrast 



can grow to fairly large values S ^> 1, without ever recollapsing [44C] 
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t V ir — tcoii — 2i ta we finally get p vir ~ 8 x 5.6 x Ap vir or since p vir = (1 + z) 3 p 

Pvir ~ 179.2(1 + z vir ) 3 p , (109) 

where z vir is the collapse redshift, and po the present matter density. Equa- 
tion (|109| ) permits us to relate the virialised density of a collapsed object to 
the epoch of its formation [p79| , p0|j422l : z vir ~ 0.18(p/p ) 1/3 - 1- (Since the 



present overdensity in clusters is 10 2 < p/po < 10 4 , the above arguments might 
indicate that clusters formed relatively recently at redshift s z < 3, provided 
they formed from spherical density enhancements.) 



Generalisations of these arguments show that the addition of a cosmological 
constant to the Einstein equations does not significantly affect the dynamics 
of a spherical overdensity [P30|J365|| . The final (virial) radius of a spherical 
overdensity in this case turns out to be 



n, 
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(110) 



where r\ = A/8nGp(t = t ta ) is the ratio of the cosmological constant to the 
background density at turnaround. Equation (|110|) indicates that the presence 
of a positive cosmological constant leads to a somewhat smaller final radius 
(and consequently higher density) of a collapsed object. This effect is clearly 
larger for objects that collapse later, when the turnaround density is lower. 

As a final remark in this context, we should mention that studies of the collapse 
of spherically-symmetric perturbations into the fully non-linear regime have 
been discussed in | |218| , ^34| , |192| , P[ |. These investigations showed that spherical 
infall onto an initial overdensity was self-similar and resulted in a density 
profile p oc r" 2 25 around the accreting object | |192| . |4T| . (Solutions with planar 
and cylindrical symmetry are discussed in [|192|| .) 



3.2. Anisotropic Collapse 



Although very convenient from the point of view of an approximate analy- 
sis, the spherical model is nevertheless an idealisation and does not describe 
the collapse of realistic astrophysical objects such as protogalaxies and pro- 
toclusters very accurately. The reason for this is that one does not expect 
a density perturbation to have started out being exactly spherical. It is well 
known that once an overdensity turns around all departures from spherical 
symmetry get amplified, a result sometimes called the Lin-Mestel-Shu insta- 



bility 1 343]. As a result the generic shape of a collapsing obect at late times 



is likely to become spheroidal even though it might have been highly spher- 
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ical to begin withF]. Studies of the motion of homogeneous spheroids have 
shown that collapse usually occus along a preferred direction which coincides 
with the smaller spheroidal axis, collapse to a point in this case occuring 
much faster than it would have for an equivalent spherical density enhance- 
ment ||353| . |573| . |343| . f406| , |399|| . An interesting new feature of anisotropic collapse 
is that the condition Q' > 1 essential for collapse in the spherical model is 
no longer necessary for spheroids, for which collapse along one axis can oc- 
cur even if the ratio's of the kinetic and potential energies marginally exceeds 
unity, implying Q' < 1 for the fluctuation [ [44(J|| . These results have recently 
been both confirmed and extended in [ [26 l|J262| , |572| . |543| . [l7ij where it is shown 
that (a) the presence of shear leads perturbations to collapse faster than they 
would have in the spherical model, and (b) shear can lead to the collapse of 
regions which are initially underdense. Some aspects of anisotropic collapse 
are also incorporated in the Zel'dovich approximation which will be discussed 
in the next chapter. 



3.3. Peaks Theory 



A simple way to seek to extend linear theory into the non-linear regime is 
to imagine that the density field evolves linearly until high-density regions 
reach some critical 'threshold' value 5 ~ 1. Such regions then collapse in some 
manner assumed to be similar to the top-hat collapse of an isolated spheri- 
cal inhomogeneity. This amounts to the assumption that the transition from 
linear to non-linear evolution is 'fast' and that peaks of the linear density 
field are the sites for non-linear structure formation. If this is true then we 
should be able to understand the clustering of non-linear objects by looking 
at the clustering of appropriate sites in the linear density field. An immediate 
problem in implementing this prescription is posed by the need to smooth 
the density field to select appropriate peaks. For power-spectra of the kind 
discussed in §2.4, the density field has structure on all scales. This means that 
5 actually has an infinite number of small scale peaks. When we are looking 
at the properties of what we assume to be protogalaxies, we do not want to 
take account of very high frequency modes which correspond to structure on 
scales much smaller than galaxies. The way to achieve this is to smooth the 
density field with some kind of low-pass filter in the manner described in §2.4. 
Implicit in this procedure is the assumption that high frequency modes of the 
system do not affect the evolution of low frequency modes, which therefore 
continue to evolve linearly; some attempt to justify this is made in |f440|| . As 



we shall see in §4, even with smoothing, this turns out not to be a particularly 
good way of looking at the formation of large-scale structure in most of the 



in 



The reverse is true for underdense regions which become progressively more spherical as 



they expand [267 268 ; see §5. 
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favoured models, because non-linear structure formation does not in general 
occur near peaks of the linear density field. Dynamical effects are important 
even on large scales, and it appears that peaks in the final density field are 
displaced significantly with respect to peaks in the initial conditions ||297|| . Nev- 



ertheless, this general approach has led to useful results and has spawned a 
huge literature on the statistics of primordial density maxima, so it is worth 
reviewing in some detail here. 

To proceed with this approach we need to construct a formalism to describe the 
statistical properties of the linear density field. In the usual models, primordial 
density fluctuations are a Gaussian random field (see §2.4). Recall that for 
such a field, all the statistical information required to specify its properties 
is contained in the autocovariance function £(r) = (8(x)8(x + r))g, where 
£(0) = a 2 , the variance of 5, and the subscript means that the average is 
taken over all spatial configurations, assuming that S(x) is spatially ergodic, 
which is actually true under very weak conditions for Gaussian fields ||. 

The first attempt to investigate the properties of linear density peaks was made 
by Doroshkevich in 1970 ||161|| , who presented his results mainly in the frame- 
work of the baryonic pancake model of galaxy formation Q More recently, this 
formalism was extended to hierarchical models of structure formation, follow- 
ing the lead of Kaiser ||286|| . Kaiser considered peaks to be defined as regions 
where the density field 5(x) exceeds some threshold value 5 C ; such regions are 
often called excursion regions and the set comprising all excursion regions is 
called the excursion set. We shall later discuss the possibility that local max- 
ima might be used rather than excursion regions to indicate the positions of 
proto-objects. With the spherical top-hat model in mind, it perhaps makes 
sense to take 5 C = 1.68, which is the (extrapolated) linear density field at 
recollapse in the spherical collapse modelQ It is straightforward to calculate 
the correlation function of points exceeding 5 C using the Gaussian prescription 
because the probability of finding two regions separated by a distance r both 
above the threshold will be just 



Q2 = J j P 2 (5i,8 2 )d8 1 d8 2 . (Ill) 
Now the iV-variate joint distribution of a set of 8i can be written as a multi- 



11 Doroshkevich calculated a number of statistical quantities of great importance including: 
the number density of peaks of a Gaussian random field in the high peak limit; the shape 
of contours surrounding maxima; the average value of the Euler characteristic defining the 
topology of contour surfaces of a given density. 

12 Numerical simulations indicate a somewhat smaller value of S c : 5 C ~ 1.33 — 1.44 p£ ,178]. 
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variate Gaussian distribution: 



I I AT I I 1/2 

Pn(S 1 , ..5 n ) = J2J1N/2 ex P H^MV) , (112) 

where M is the inverse of the correlation matrix C = (Sidj) and V is a column 
vector made from the Si. The ability to construct, not only the n-dimensional 
joint distribution of values of S, but also joint distributions of derivatives of 
arbitrary order which are all of the form ( |112| ) , is what makes Gaussian random 



fields so useful from an analytical point of view; see 0. The expression ( |112|) is 
considerably simplified by the fact that (Si) = for all i. Using the substitution 
5i = Vi<y and w(r) = £(r) /a 2 yields 

P2\vi,V2) = - y exp<^ — — \. (113) 

2tt _ w 2( r ) { 2[1 -w 2 (r)} J 

The two-point correlation function for points exceeding v c = S c j o is then 



Q 



l+U = ^, (114) 

where Q\ = Pr(z/ > v c ). The exact calculation of the integrals in this equation 
is difficult but various approximate relations have been obtained. For large v c 



and small w we have 286 



U ~ v 2 c w(r), (US) 



while another expression which is valid when w is not necessarily small is [ [459|| 

£ Vc ~ exp[i/ c 2 w(r)] - 1. (116) 



More accurate expansions are given in [276]; higher-order correlations in this 



model can be expressed in the Kirkwood scaling form, see §8.3. Kaiser [ [286] 
initially introduced this model to explain the enhanced correlations of Abell 
clusters compared to those of galaxies. Here the field 5 is initially smoothed 
with a filter of radius several Mpc to pick out structure on the appropriate 
scale. If galaxies trace the mass, and so have £, gg {r) ~ £(r) then the simple 



relation (115) explains qualitatively why cluster correlations might have the 



same slope, but a higher amplitude than the galaxy correlations. This enhance- 
ment is natural because rich clusters are defined as structures within which 
the density of matter exceeds the average density by some fairly well-defined 
factor in very much the way assumed in this calculation. 

This simple argument spawned more detailed analyses of the statistics of Gaus- 
sian random fields |2l MJn^p4j|i^ , |l^j33q , |43li . The interest in most of 



these studies was the idea that galaxies themselves might form only at peaks 
of the linear density field (this time smoothed with a smaller filtering radius). 
If galaxies only form from large upward fluctuations in the linear density field 
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then they too should display enhanced correlations with respect to the mat- 
ter. This seemed to be the kind of bias required to reconcile the standard 
CDM model with observations of galaxy peculiar motions ||148|| . One should 



say, however, that there is no reason a priori to believe that galaxy formation 
should be restricted to peaks of particularly high initial density It is true 
that peaks collapsing later might produce objects with a lower final density 
than peaks collapsing earlier, but these could (and perhaps should) still corre- 
spond to galaxies. Some astrophysical mechanism must be introduced which 
will inhibit galaxy formation in the lower peaks. Many mechanisms have been 
suggested, but none of these are particularly compelling. Indeed, the COBE 
CMB fluctuations have recently given cause to doubt the validity of this kind 
of biasing as a means for reconciling CDM with observations. 

Notwithstanding these reservations about the validity of peaks theory, the 
idea has led to some interesting analyses of the properties of Gaussian ran- 
dom fields and the possible effects of biasing. One particular line of enquiry 
has been to look at local maxima of the density field, rather than just points 
where 5 exceeds some particular value 5 C . The trouble with the original calcu- 
lation by Kaiser |286|1 is that it applies no constraint that the two field values 



5i and 82 are in different proto-objects. The smoothing required to select ob- 
jects of the correct characteristic scale introduces fairly large scale coherence 
into the density field. This means that regions where 5 exceeds some value 
may be quite large compared with the mean spacing of clusters. Since each 
disjoint region above the threshold is implicitly assumed to form one cluster, 
the simple calculation given above will lead to an overcounting of pairs of 
objects separated by distances less than the typical size of excursion regions 



This suggests that a more accurate representation of reality might be ob- 
tained by looking at local maxima of the field. If the criterion for proto-object 
sites is that 5 be a local maximum, it is impossible that two distinct spatial 
positions satisfying this criterion can both belong to the same proto-object. 
Dealing with local maxima complicates the calculations considerably, but some 
progress is possible. For illustrative purposes, we discuss here the argument 
for fluctuation fields defined in one spatial dimension. The difficulties with 
three dimensions will quickly become obvious! 

To handle local maxima, we need to consider not just the value of 5 at a given 
spatial location, but also the first and second derivatives of S. Smoothing is also 
required here, otherwise the field 5 would not be differentiate (in the mean- 
square sense: ||) and local maxima would be consequently undefined. We need 
first need to consider P(S, S',S"), where the primes denote derivatives with 
respect to the spatial coordinate (which we denote x). This joint probability 
is also a multivariate normal distribution ( |1 1 2| ) , but the column vector is 
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written V = (5, 5', 5") and the covariance matrix C has the form 



-a 

'2 



2 \ 
1 



o{ 
\-a\ a\ j 



(117) 



The of are moments of the power-spectrum of the density fluctuations in one 
dimension: 

a) = f P^k^dk, (118) 



where P\{k) is the one-dimensional Fourier transform of £(r) [119, 350| . In 



this notation the variance of 5 is just a = a 2 . To select stationary points 
we need to specify 5' = 0. Since such points form a set of measure zero, 
the appropriate probability would seem to be identically zero. This problem 
can be avoided by replacing the 'volume' dV = d5d5'd5" by d5 dS"\5"\ dx: 
assuming that 5 is spatially ergodic then means that the integral over x can 
be taken to be an integral over the probability distribution and one avoids the 
measure problem. It helps to define, in addition to v (which is as above), the 
parameters 7 = af/002 (which is a measure of the bandwidth of the spectrum 
Pi), and i?* = 01/02, a measure of the characteristic coherence length of the 
field. Defining 9 = —5"/ 02 gives the number-density of stationary points as: 



dN n 



\9\d9dv 



(2vr) 3 / 2 (l - 7 2 )V2R > 



■ exp 




;ii9) 



To further specify local maxima, we require 9 > 0: N p k = dNo is then 
the number of local maxima per unit length. By integrating over v first, and 
then over 9 we find that N p k = 1/2ttR*. To find the number-density of local 
maxima as a function of their (dimensionless) height v we need to integrate 
over 9. The result was found analytically in a classic paper by Cartwright and 
Longuet-Higgins [|100|| who give the probability: 



> v) 



erfc 




-u 2 /2 



1 + erf 



71/ 



/ 2(1- T 2 



(120) 

Notice that in the limit 7^0 the distribution of heights at maxima is just a 
Gaussian. This is the limit of infinitely broad spectrum (i.e. no filtering) and 
it simply tells us that, without the low-pass filter, every point is a local max- 
imum! The limit for a ^-function spectrum 7 — >• 1 is a Rayleigh distribution. 
The fact that a narrow spectrum produces peaks which have v > is not 
surprising: a monochromatic plane wave has all its local maxima at v > 0. 

It is conceptually straightforward to extend these results to three spatial 
dimensions, but it is technically difficult. The reason for the complexity is 
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that, in three dimensions, the local maximum constraint requires the three 
spatial derivatives of S(x) to be zero, and that the matrix of second deriva- 
tives d 2 S I 'dxidxj be negative definite (having three negative eigenvalues). One 
therefore needs a 10-dimensional probability density function for 5, its three 
first derivatives, and the six independent second derivatives. One then has to 
integrate over the region where the second derivative matrix is negative defi- 
nite ||. We shall not go into details here; the differential number-density of 
local maxima has been obtained by p3[ as the function 



dN 



where i?* = \^3a 1 /a 2 , 
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(122) 



The function f(x) is given by |23 



/(*) 




+ erf 




5x A /8 I / I p -5x 2 /2 



(123) 
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In one dimension, the corresponding result has the same form as eq. ( 122 
but with f(x) = x; in two dimensions the result has f(x) = x 2 + e 
The number of local maxima per unit volume turns out to be N p k 
For high peaks 

1/7 



0.016.R; 



N pk (> v) 



v 2 e 



. . . (124) 

As shown in |23| the number density of high peaks in the limit ^> 1 is 
related to the density N x of the Euler characteristic of the surfaces bounding 
the peaks by N pk (> v) = §iV x (> v). Q 



To calculate the two-point correlation function of local maxima is more com- 
plicated still. In three dimensions one needs to construct the 20-dimensional 
distribution of 5 and its first two spatial derivatives at two distinct spatial 
locations, and then integrate imposing the peak constraint at each spatial 
location. No exact analytic results are known for this calculation. In one di- 
mension (which we hope will give a good qualitative guide to the behaviour of 
three-dimensional peaks), the problem boils down to the construction and in- 
tegration over a six-dimensional probability distribution |[L19| . 350|| . Even here, 



13 The relationship between the Euler characteristic x an< l the genus used in §8.8 is genus 
1-X/2. 
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analytical results are difficult to obtain and one must resort to some numeri- 
cal integration. At very large separations, however, the peak-peak correlation 
function has a similar form to ( |116| ), but with v defined in a different way 
|350|| . It is not possible, even in one spatial dimension, to express the peak- 



peak correlation function as a simple function of the underlying correlation 
function of the density fluctuations. This is because the peak-peak correlation 
function depends, not just on £(r) but also upon its derivatives with respect to 
r. For example, notice that the Politzer-Wise formula, equation ( |116|) , gives a 
biased correlation function which is zero whenever the underlying w(r) is zero 
which is not necessarily the case when one considers local maxima [ pL 19|j350 



Higher-order correlations of peaks in one-dimensional fields are discussed in 



124 



The problem with this entire approach is that it assumes that non-linear 
evolution does not move proto-objects away from the positions of local density 
maxima in the linear field. This is unlikely to be the case, except for very large 
separations (where the displacement of matter from its linear position might 
be small compared to the peak separation), or if the primordial spectrum has 
very little large-scale power. A comparison between the 'peaks' of the density 
field evolved using a full numerical simulation, with the peaks in the linear 



density field shows very little correspondence between the two [|128| , [297|| . It 
seems therefore that linear density maxima are probably not, in themselves, 
very good candidates for the positions of galaxy formation; the situation may 
be more favourable for cluster-scale peaks. 

A closely related issue concerning the peaks picture of galaxy formation is the 
influence of shear (both internal - due to asphericity, and external - due to 
the surrounding matter distribution) on the evolution of density peaks. Recent 
studies conducted on galaxy scale peaks in a Gaussian random field show 
that shear can occasionally break up a single density peak into two or more 
halos [|543| , ^72|j66|j67|j68|j69|| . It can also, by boosting the accretion of matter, 



occasionally promote the growth of smaller scale peaks into larger ones. Such 
studies demonstrate that the conventional peaks scenario in which halo's of a 
characteristic scale and peaks in a Gaussian random field (filtered on the same 
scale) are held in one-to-one correspondence, is somewhat simplistic and needs 
to be considerably modified before it is taken seriouslyQ Some attempts have 
been made to include dynamics in this kind of analysis, although it is very 
difficult to do so convincingly [ 23fl . An impressive attempt at this was made by 



Bond & Couchman pD| , who used the Zel'dovich approximation (see later) to 
move linear density peaks away from their initial positions; see also ||362|| . The 
correlation function of the evolved peaks can then be expressed in terms of 
the initial correlations of the linear density peaks, modified by corrections for 

14 Tidal effects are less important for high peaks which can be treated as rare events. 
As demonstrated in p5| the non-linear evolution of a high density peak follows spherical 
collapse if the spectral index n < — 1. 
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the displacement caused by non-linear evolution. Although this calculation 
leads to an exact solution for £, it is not entirely satisfactory because linear 
density peaks are not necesarily in one-to-one correspondence with peaks in 
the evolved field (some may disrupt, others may merge). Because clustering 
evolves dynamically, the final density peaks are formed by the action of particle 
displacements generated by the gravitational potential, which is not a simple 
local function of the density field. A better procedure would therefore be to 



look for peaks in the final density field, as is done in [297.75 



In a bold extension of the peak formalism, Bond and Myers po] , |67 ,5B,B5| have 
developed the so-called "peak-patch" picture for identifying and tracing the 
dynamical evolution of protostructure. In this picture, an entire hierarchy of 
peaks on different scales is identified using different values of the filter radius. 
An ellipsoidal collapse model is then used to follow the evolution within each 
patch, including the influence on its evolution of the external tidal field around 
it. By following peak patches on different scales simultaneously, one can allow 
for patches that overlap in the evolved system. The Zel'dovich approximation 
(§4.2) is then used to move the peak patches thus identified to their final 
Eulerian position. Both the statistical and dynamic contributions to the final 
correlation function are therefore included in this treatment. The use of a 
"catalogue" of peaks as a function of smoothing radius allows one to counteract 
the obvious limitation of the standard peaks theory we have concentrated on 
so far, namely that it assumes all peaks result from the application of a single 
smoothing radius. The procedure is, however, technically complex and it is 
not yet clear how useful will be the results it generates. We shall also make 
some comments on the peak-patch picture at the end of §3.4. 



3-4- The Press-Schechter Approach 



One of the problems involved in trying to identify structure in density fields is 
that the power spectrum of the density field may not possess a physical cutoff 
on small scales. In other words, the filtered variance cr(R) (|61|) diverges as 
the filtering scale is reduced. Such fields always contain structure on a small 
enough scales which is non-linear. One expects structure thus to develop hier- 
archically, with large structures forming by the merger of small-scale lumps. 
This seems far too complicated a process to be dealt with in any variant of 



linear theory, but there is a method, developed by Press & Schechter [§61] 
which attempts to explain some aspects of hierarchical clustering in terms of 
properties of the linear density field. We shall discuss this method in some de- 
tail here, because of its widespread use, and surprising effectiveness! We shall 
henceforward refer to this method as the PS method. 

The PS formalism rests on one critical assumption: that, even if the density 
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field is non-linear on some scale, the amplitude of long-wave modes in the 
density field is close to that predicted by linear theory. This effectively means 
that there must be enough large-scale power in the density field that it is 
not swamped by power generated by non-linear interaction of small-scale 
modes. Consider a massive clump. As described in §3.1, the clump will undergo 
collapse (assuming it is roughly spherical), if its overdensity exceeds some 
critical value 5 C which is of order unity. This suggests that one can identify 
the positions of clumps which will be bound by smoothing the linear density 
field on some scale Rf. On average, a mass M ~ PqRj will be contained within 
a filtering radius, so it is reasonable to assume that points where 5 smoothed 
on this scale exceeds 5 C will be bound objects with a mass M. Nothing in this 
approach determines the form of the filter function: it is conventional to use 
a Gaussian for no other reason than analytical convenience. 

Now, assuming a Gaussian distribution for S, the probability that a randomly- 
selected point exceeds the threshold 5 C when smoothed on a scale Rf is just 



Pr(<5 > 5 e \R f ) =Ps=\ 



1 — erf 



S c 



(125) 



where cr| is the variance of the smoothed field (pID- We now follow the PS 
argument. This probability, ( |125|) , is assumed to be proportional to the prob- 
ability that a given point has ever been contained in a collapsed object on a 
scale > Rf. It is not obvious that this assertion is correct: it amounts to assum- 
ing that the only objects that exist at a given epoch are those which have just 
collapsed. If an object has 5 > 5 C when smoothed on some scale R, it will have 
5 = S c after R is increased, and will therefore be counted as part of a higher 
level in the hierarchy ||430 |. One problem with this assumption is immediately 



obvious, however: matter in underdense regions is not correctly dealt with. 
As a result, half the mass remains unaccounted for. PS 'solved' this problem 
by simply multiplying the probability by a factor of two and attributing the 
additional mass to accretion. This has, however, long been recognised as a 
fiddle. For the moment, we shall leave this problem to one side and continue 
with the PS analysis. 

Let us define the mass function of bound objects, f(M), such that f(M) dM 
is the comoving number density of objects with masses in the range (M, M + 
dM). This function is related to the probability P$ in equation ( |125| ) by 

Mf(M) = po| dP*/dM|, (126) 

where po is the usual mean density of cosmic material. Hence, 

M 2 f(M) = ^J-exp(-5 c 2 /2a 2 ) | dloga f /dlogM|. (127) 

Of V 7T V ' 
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It is useful to define a multiplicity function, q(m), to be the fraction of mass 
contained in objects with mass in unit range of logM: this is just q(M) = 
M 2 f(M)/p . This function has a simple form if the power-spectrum of the 
linear field has a power-law shape (P{k) oc k n ): 

^ = ^^(-t)- (128) 



where v = 5 c /<Jf. 



This calculation of the mass and multiplicity functions of objects forming 
hierchically, using only the properties of the linear density field, contains a 
number of assumptions which are difficult to justify. Even if one accepts the 
basic assumption, that there is a reasonable correspondence between the sites 
where virialised structures form and regions of high density in the linear den- 
sity field, there is still the problem of the factor of 2 'fudge' in the PS analysis. 
One would be inclined to consign the entire theory to the dustbin were it 
not for the remarkable fact that the final result, equation ( |1 28| ) is in remark- 
ably good accord with the results from iV-body simulations | |176| , |178| ]. This 
method has consequently been used in a wide variety of different contexts to 
estimate f(M) for cosmological objects [p^ p| JTT3| , [ill,|l 76| , |1 78jp9| ,^ . The 
unreasonable effectiveness of the PS formalism has spurred others to look at 
the problem from different points of view to see why it works so well. Although 
the issue is not fully resolved as yet, substantial progress has been made using 
some of the techniques outlined in §3.3. Since one might reasonably identify 
proto-objects with peaks of the linear density field, adaptation of the peaks 
formalism to the calculation of the mass function would seem to be a fruit- 
ful line of enquiry [ 8, 23|, 64, 12, 59, 433 1. The most important outcome of these 
analyses has been a clarification of the 'cloud-in-cloud' problem. This is sim- 
ply that the PS analysis does not check that objects which are considered to 
be 'bound' on a scale M are not subsumed into objects on a larger scale as 
the hierarchy evolves. In fact, the Press-Schechter formalism does not really 
deal with localised objects at all: it is simply a scheme for labelling points 
in the linear density field. It turns out that the failure to deal correctly with 
substructure is at the root of the infamous factor two. 



Recall the one-dimensional analysis of local maxima we referred to in the 
previous section. Here one looks to find locations where the trajectory of S(x) 
as a function of x exceeds some value ua, or is a local maximum at some point 
x. For the mass function analysis, it helps to consider the field 5, not as a 
function of spatial position but as a function of smoothing radius R (dropping 
the suffix / for brevity) at some fixed spatial location. Clearly S(R) — > 
as R — > oo. As R decreases, S(R) will develop fluctuations whose amplitude 
continues to increase as R gets smaller. If 5(R) < 5 C for some value Ri, it may 
well exceed S c again for some smaller value of R. (Indeed, if the small-scale 
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power spectrum diverges, this will happen with probability one.) Clearly as 
one proceeds to filter the density field on a range of decreasing scales R%, R n 
the corresponding density contrast at a given location S(x) executes a random 
walk through the points (8i,...,8 n ). The first upcrossing of the level 8 C will 
occur for the largest value of Ri for which Si > 5 C . 

Here the analysis depends on the form of spatial filter one uses. Consider 
the illustrative example of a sharp filter in fc-space, and consider what hap- 
pens in the Fourier domain as R is decreased from infinity. Decreasing R 
corresponds to adding /c-space shells which, for Gaussian fluctuations, are in- 
dependent of each other. As a result the steps in the random walk of S(R) 
become independent for a sharp k-space filter and the solution follows very 
easilyp^l ||433|J59| , ^30| . Consider a point on the trajectory which is just sitting at 
the threshold. The subsequent motion will be symmetric so that it is equally 
likely to be above or below S c at any smaller value of R. The probability that 
the trajectory does not cross the threshold - the survival probability, P s - is 
then given by 



dP s 



dS o-J2n 



S 2 \ ( (5 - 25, 



exp — — — — exp 



2 ( 



2a 2 r 2a 2 



(129) 



One can integrate this expression to give the probability of having crossed the 
threshold at least once to be 

P>! = 1 - P s = 1 - erf {Jj^ , (130) 

which is simply twice the unconditional probability of lying above the thresh- 
old, given by eq. ( |125| ). This therefore supplies exactly the missing factor of 2. 
Unfortunately, however, this does not provide a full solution of the problem 
because different filter functions behave differently to this simple example. 
What happens generically, is that the PS form, equation ( |127| ) is recovered 
for large M but the shape of the function is changed at low M to account for 
the factor two in the total probability. Our problem to figure out why the PS 
analysis works is, therefore, reduced to the problem of knowing why to use 
a particular form for the filter function. At present, we cannot say why the 
sharp &;-space filter best represents the process of hierarchical clustering other 
than that that it is the choice that produces the PS result which is obeyed 
empirically. It may also be that the agreement between PS and numerical 
simulations is essentially fortuitous. One reason for believing that this may be 
the case is that, although the mass function of objects may agree with the PS 
result, the spatial locations of bound structures in simulations are very poorly 
correlated with peaks in the linear density field [ |12ij| , |297| |. Although this does 

15 In the continuum limit the random walk is described by a diffusion equation which can 
be solved analytically p9[. 
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not in itself rule out the PS analysis - the mass function does not specify 
the locations of the final, virialised structures - it certainly suggests that one 
should not take the derivation too literally. It also suggests that attempts to 
calculate the spatial correlations of virialised objects of different masses by 
mixing the PS formalism with the correlations of peaks in the linear density 



field, e.g. |[293|| , are likely to be of little relevance to the real world. On the 
other hand, in §4.5 we show how to estimate the mass function of collapsed 
objects using adhesion theory, which at least attempts to treat dynamical evo- 
lution in a reasonable way, and the results do agree with the PS formalism 
reasonably well, so a method may well be available for this kind of analysis. 
Various alternative approaches have been explored [[326|J302|]303| , |300| , |1 1 1|| , but 
probably the most promising is the peak-patch picture [^6|,0,^,^] which 
appears to have the most detailed treatment of the dynamics of collapse and 
implicitly takes into account the hierarchical nature of the evolution. Q 



3.5. The Lognormal Model 



In §3.3 and §3.4, we dealt with attempts to identify non-linear structures with 
structures in the initial linear density field. As long as matter does not move 
too far away from its primordial location as a result of non-linear evolution, 
these methods might be fairly accurate. In the next sections, we shall discuss 
ways of approximating the full dynamical evolution of structure and thus 
improving upon simple extrapolations of linear theory. Before we move onto 
dynamical approximations, however, it is worth discussing another way of 
extrapolating linear theory which, although simple, seems to describe some 
aspects of the clustering phenomenon reasonably well. 

If the initial density fluctuations are Gaussian (§2.4), then the form of the 
statistical distribution function for 5 keeps the same form - eq. (|63|) - as the 
fluctuations evolve in the linear regime. To apply peaks theory and the Press- 
Schechter approach one implicitly assumes that this distribution function is 
still appropriate for the fluctuations when they are about to collapse. But 
collapse occurs, in the simplest picture at any rate, when S is at least of order 
unity. At the onset of structure formation at a given scale, we must have an 
rms density contrast on that scale of order unity. But the Gaussian distribution 
with a 2 ~ 1 assigns a sizeable probability to regions with 5 < — 1 which are 
clearly unphysical (p < 0). The assumption of Gaussian statistics in the linear 
regime is therefore physically unreasonable. 

These thoughts suggest that one might achieve some measure of improvement 
over linear theory by adopting a distribution function for S which does not 



16 



An interesting formal connection between the Press-Schechter formalism and that of 



Cayley trees has been explored in [105 
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allow 5 < — 1. In the spirit of the other approximations in this section, it 
is reasonable to achieve this by a simple mapping of the initial linear field 
^ lin l— *■ &nl such that the mapping is one-to-one but enforces the physical 
requirement that there should not be regions with negative total density. Of 
course, this ansatz does not include dynamics in a physically reasonable way, 
but one might expect it be some kind of improvement on linear theory. There 
are various ways to impose the mapping [123|, but the simplest and probably 
most useful is with an exponential function: 



P 
P 



5 NL oc exp(5 L77 v). 



(131) 



We have implicitly assumed that the linear field is smoothed on some scale, 
say R, which we have not specified. Obviously, the mapping ( |131| ) does not 
commute with smoothing of S so the actual mass distribution would not have 
the same statistical form when smoothed on different scales. The resulting 
distribution function is called a lognormal distribution and its properties are 
discussed extensively in ||127|| . The constant of proportionality in eq. (|131|) 
must be fixed by requiring that the mean value of 5^l be zero; it will therefore 
depend upon the statistics of 5lin, i-e. upon the linear theory variance a\. 
Of course, the variance of the non-linear 5nl will not be the same as the 
linearly-extrapolated field 5lin- In fact, 



exp 



(°lin) 



- 1, 



(132) 



where a 2 is now the variance of the non-linear field. The probability distribu- 
tion function of 5 (dropping the suffix NL) becomes 



P(5) 



2vr log(l + (7 2 ) 



: exp 



log 2 {1+5)VT+^ 



21og(l + a 2 



(133) 



| |127| , |102| | . The great advantage of this distribution is that one can calculate an- 
alytically any property which can be calculated for the Gaussian distribution, 
using the simple local mapping, eq. (131). The two-point correlation function 
is given by 

£(r) = exp [&(r)] - 1, (134) 

where £l is the linear theory correlation function. Notice the similarity between 
this equation and eq. (|116|) . This similarity extends further: the higher-order 



correlation functions for this model obey the Kirkwood ||307|| scaling relation- 
ship, equation fl299| ), exactly for all orders n. The lognormal field therefore 
has correlation functions which are exactly like those of the biased Gaussian 
density field computed by Politzer & Wise ||459|| . Other statistical properties of 
the P(p) described by equation ( |133| ) are discussed in |7|, |1 23| , |TT8| , |1 1 9j , |T27l . |l 02 



It is worth mentioning just one of these properties here: the lognormal is an 
example of a probability distribution that is not completely specified by its 



45 



moments. In other words, knowledge of all the moments of the distribution of 
all orders n is not sufficient to specify the P{p) completely. This is related to 
the fact that the series expansion of the moment generating function for this 
distribution does not converge ||140| , |127| , |536|| . This apparently bizarre property 



is basically due to the extremely long tail at large positive values in the log- 
normal distribution. It serves as a warning that many of the statistical tools 
one uses in the study of galaxy clustering rely on assumptions that may not 
actually be realised in practice. 



We should point out here that the lognormal distribution, equation (fT33|), is a 
model for the continuous density field and cannot be directly compared with 
galaxy counts, or even particle counts in an iV-body simulation. There is no 
unique way to obtain a discrete distribution from a continuous one; the stan- 
dard procedure in cosmology is to assume that objects are a random (Poisson) 
sample of the underlying density field with a mean rate proportional to the 
local matter density. In other words the probability of a small volume dV con- 
taining a galaxy is just dP oc p dV. One can then construct the probability 
that a volume V contains N galaxies as 

P(N) = J P(p)P(N\p)dp, (135) 
o 

where Vp = J v p dV, so that p is the density smoothed over the cell vol- 
ume. If the distribution is lognormal when smoothed on the scale V, then 
P(p) is given by eq. (|133|) , and P(Np) is just a Poisson distribution. The 
resulting compound distribution is called the Poisson-lognormal distribution. 
Unfortunately, there is no simple closed-form expression for P(N) [|127| , 140 |. 



This distribution function seems to match the observed distribution function 
of galaxies reasonably well ||515|| . An alternative, less transparent, route to 



the 'discretised' form of the pdf for a Poisson-sampled field starts from the 
moment generating function |200| , |127|| . 

Although the lognormal model density field must be thought of as mainly 
phenomenological, and was introduced into the cosmological literature for this 
reason, it is remarkable that numerical A^-body experiments and other ana- 
lytical considerations (§4) show that the distribution function of fluctuations 



in the non-linear regime is well-described by the form ( |133|) . The reason why 
this is the case has been the cause of much debate; various explanations are 



discussed in p7|J317|| . There is some qualitative motivation for the lognormal 
distribution in general terms because of the properties of non-linear processes. 
The central limit theorem states that the distribution of the sum of n inde- 
pendent random variables tends towards a Gaussian distribution for large n 
regardless of the form of the distribution function of the individual random 
variables. If instead we consider the product of n independent random vari- 
ables, then it is clear that the logarithm of this product should have a normal 
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distribution by virtue of the central limit theorem. Lognormal distributions 
occur widely in non-linear physics, just as normal distributions do in linear 
physics. This argument does not give us any motivation for having taken a 
simple local extrapolation of the density field like equation ( p.31| ); a more spe- 



cific argument is needed. Coles & Jones (1991) took the continuity equation 



( lq) as the starting point for a simple argument. If we change variables, using 
a dr = dt and g = pa 3 , this equation can be written in the form 

t£ + (v- V)g + g(V-v) = 0. (136) 

Introducing the total time derivative with respect to conformal time r, we can 
then write 

i£ = -(?•«). (137) 
g dr 

If the velocity field v is Gaussian, then V • v will be Gaussian, too. By inspec- 



tion, the solution of eq. ( 137 ) will therefore be a lognormal distribution for g. 
This does not, however, prove that the real space distribution of g should be 
accurately lognormal in the non-linear regime because (a) the velocity field 
will not evolve exactly linearly in this regime and (b) the total time deriva- 
tive d/ dr follows the fluid elements. At best, this argument can apply only 
in the very weakly non-linear regime. Further confirmation that this is not 
the correct explanation for the effectiveness of the lognormal distribution in 
the strongly non-linear regime is afforded by the comparison between full 
A-body evolution and the lognormal extrapolation via equation ( |131| ) of the 



same initial conditions. This exercise was part of an extended study by |128 



which also looked at other approximations. The result shows that, although 
the distribution function of fluctuations in the final A-body simulations is 
fairly accurately described by a lognormal distribution, the variance of this 
distribution is not that obtained by the simple transformation of the linear 
variance via equation ( |132j ). This means that we need to look more deeply 
into the problem before coming up with any real justification for this model. 
We discuss more detailed dynamical justification for the lognormal approxi- 
mation in §4.2; see also fl3T| , pi7| , p~33"] . Some other non-Gaussian distributions 



are discussed in ||400|J557 . 



4. Non— linear Approximations based on Dynamics 



Although local extensions of linear theory do provide a qualitative first step in 
comparing theory with observations, a deeper insight into gravitational clus- 
tering is provided by the dynamical approximations we consider now. Depend- 
ing upon the degree of nonlinear clustering (characterised by the rms density 
fluctuation a), a particular dynamical approximation may be best suited for 
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describing gravitational clustering at that epoch. In the weakly non-linear 
regime when a < 1 (also known as the quasi-linear regime), Eulerian pertur- 
bation theory (§4.1) provides us with an excellent description of gravitational 
clustering and of the accompanying departures from Gaussianity which result 
if the initial density field was Gaussian. At later epochs (a < 1) the Zel'dovich 
approximation and its perturbative extensions (§4.2, §4.4) provide us with a 
quantitative understanding of gravitational dynamics until the formation of 
pancakes (a ~ 1). Several approximations have been suggested to model grav- 
itational instability during the strongly non-linear regime when o > 1 (§4.3, 
§4.5 - §4.7). Perhaps the most promising of these is the adhesion model (§4.5) 
which provides us with a deep understanding of gravitational clustering and 
describes the growth of cellular structure in the form of pancakes, filaments 
and clumps, as well as the dynamics of voids. A comparison of these different 
approximations is outlined in §4.7. We conclude this section with a brief dis- 
cussion of approximations inspired by a relativistic approach to perturbations. 



4-1. Higher-Order Eulerian Theory 



The linear regime of gravitational instability breaks down when 5 becomes 
comparable to unity marking the commencement of the quasi-linear (or weakly 
non-linear) regime. During this regime the density contrast remains small 
(5 < 1), but the phases of the fourier components <5k become substantially dif- 
ferent from their initial values resulting in the gradual development of a non- 
Gaussian distribution function if the primordial density field was Gaussian 
469| , |534| , [5lTl"| . p20|| . The most obvious way to attempt to extend linear theory 



into the fully non-linear regime is via a higher-order perturbative expansion. 

In this approach [ pqpg|p27| |78l,P2l,|54] PB]|IU^ we write the density contrast 
as a superposition of terms: 

oo 

n=l 

oo 

= J2 D K t ) A(n) (x)- ( 138 ) 



i=l 



For simplicity we shall assume the case of a flat universe which is matter- 
dominated. Perturbative methods for open universe models are discussed in 
||366| , |T8|| . The first term in the above series expansion corresponds to the lin- 
ear growth law: 5^' = D + (t)A^(x). The second-order contribution can be 
written: 

5 (2) = ^(5 (1) ) 2 + W« • V0« + ± E(V,V,0«) 2 , (139) 
where V denotes the derivative with respect to the Eulerian (comoving) co- 
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ordinate x and ft 1 ' is the linear gravitational potential. Higher-order terms 
become rapidly more complicated as n increases. For distributions which are 
initially Gaussian, the weakly non-linear regime witnesses the gradual emer- 
gence of non-Gaussian properties which can be described by the following 
hierarchical relationship for the connected moments of the density field, as 
explained in §8.3: 

{5 n ) c ~S n {5 2 ) n -\ (140) 

The subscript c refers to the connected part of the moment, i.e. that part 
which cannot be expressed in terms of lower-order moments of 5; see §8. 
The coefficent S3 describes the skewness and 64 the kurtosis, as mentioned 
in §8.5. For a Gaussian distribution, S n = for all n > 3. The values of 
n can be calculated exactly by means of an elegant formalism introduced by 
Bernardeau EM. In this approach one constructs vertex weights for the various 



tree diagrams, according to 



_ /A(nh J(5^(x,a)6^i,o) ■ • i (1) fe,a)) c d 3 xd 3 i 1 . . . d 3 x n 
Vn K )c (J(5^(x,a)5^(x',a)) c d^xd^') n ■ 1 J 

It is helpful to obtain the weights from the generating function: 

G^EV. (142) 

The S n can then be straightforwardly obtained from the v n : 

5 3 = 3z/ 2 

5 4 = 4^3 + 12z/ 2 2 

5 5 = 5^4 + 60z/ 3 z/ 2 + 60^ 

5 6 = Qu 5 + 120u 4 u 2 + 90z/ 3 2 + 720z/ 3 z/ 2 2 + 360z/ 2 4 . (143) 

Remarkably, the generating function G can be shown to evolve according the 
same equation as a spherical top-hat solution: 

G. = (? - l) = - !, (M4) 

\p J 2(1 — cos0) d 

where the time-coordinate is reparameterised according to: 



x a 
r = d - 



3 



(9 — sin 1 



2/3 



(145) 



a 5 U 

Expanding G as a power-series in r leads to 

G s = r + 0.810r 2 + 0.601r 3 + 0.426r 4 + 0.293r 5 + . . . , (146) 

which leads to a straightforward determination of the u n and thence the S n . 
The results for S3 are the same as those obtained by a more straightforward 
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analysis by ||440|| : we get S3 = 34/7 = 4.857, in this case. However, this result 
is only valid only if the density field is not smoothed. For smoothed final den- 
sity fields the value of S n depends both on the form of the window function 
and upon the slope of the power spectrum at values of k corresponding to 
the smoothing length. For scale invariant spectra P{k) oc k n with spherical 
top-hat smoothing S3 = 34/7 — (n + 3) ||283| , |36| . For Gaussian smoothing the 
expression for S3 is more complicated |346|| . (Redshift-space distortions to S3 
and other moments are discussed in |[258|| , see also ||104|| .) We shall discuss 
these results later, in §4.7, where we compare the full perturbative solution 
for the hiearchical coefficients with the corresponding results obtained for var- 
ious approximation schemes. (The function Gs always obeys the equation for 
spherical collapse so all we need to do is to solve that problem in a given 
approximation and then calculate the S n as above.) Skewness and kurtosis of 
velocity fields are discussed in |283|j3l||4*0"3"|l , and also in §8.10. 



For small values of a -C 1, the moments S n can be used to reconstruct the prob- 
ability density function (PDF)P^] in the quasi-linear regime using the Edge- 
worth expansion ||304| , |37l , p8"4Tl 



P(S)d5 



1 



-exp(-z/ 2 /2) 



1+ a ^H 3 (u)+ a 2 
o 
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SI 



S 1 
24 



HAv) 



d8, 



Q'2 



(147) 



where v = 5 /a and H n (u) are the Hermite polynomials 



H n {u) = (-l)«exp(z/ 2 /2)^exp(-z/ 2 /2). 



(148) 



In general, the cr-dependence of S n must be taken into account when applying 
equation (liTOT 



The quantity in square brackets provides a correction to the initial Gaus- 
sian distribution. Since the Edgeworth form of the Gram-Charlier series is an 
asymptotic expansion in S„, one needs to know all the moments in order to 
reconstruct the PDF. In practice however, the first two moments S3 and S4, 
suffice to determine the maximum of the PDF to within a reasonable accuracy 
when a < 0.5 ||37|| . As a result, measurements of the shape of the PDF near its 
peak (which is a statistically robust quantity), may allow us to determine the 



17 The PDF can be thought as the volume fraction occupied by a given value of the density. 

18 In the one dimensional case, until shell crossing |37) 83(a) = 6 + 24cr 2 + 0(er 4 ); Si(a) = 
72 + 810cr 2 + 0(<7 4 ); which converge to S3 = 6, S4 — 72 for infinitesimal values of a. In 
three dimensions the functional forms for £3 (er), S^a) etc. are not known; results obtained 
using the Zel'dovich approximation are discussed in p7{ . For nominal values of er, good 
agreement with the PDF obtained from N-body simulations of the CDM model is provided 
by the lognormal model (see §3.5) for which pTl] S 3 (a) = 3+ct 2 , Sa(<t) = 16+15cr 2 +6a 4 +cr 6 . 
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lowest moments S3 and S4 in the weakly non-linear regime when er <C 1. One 
should be aware, however, that there are restrictions on the possible behaviour 
of the moments in order that the Edgeworth expansion converges. 

Another useful way to employ perturbative techniques is by moving into the 
Fourier domain. One can then obtain higher-order corrections to such quanti- 
ties as the power-spectrum. The approach we follow here is that of | 274j| ; see 



also [ |198| . |213| , |359| , |2U3| |. We begin by taking the Fourier transform of 5(x,7]) as 
in §2; rj is the conformal time coordinate drj = dt/a(t): 

<5& V) = 7^53 / d 3 xexp(-ik-x)5(x,r / ). (149) 

In linear theory, the velocity field is curl-free and is therefore completely 
specified by knowledge of 9 = V • v . With this definition, equations (|18D and 
(|T6| ) can be Fourier transformed to yield: 

^- + 6 = - J d% J d%6»(k 1 +k 2 -k)^§(k 1 ,7,)6(k 2 ,7 J ) (150) 

^- + ^0 + ^5 = - f d%[ d%6 D (k 1 +k 2 -k)^ ¥ ^e(k 1 , v )d(k 2 , v ), 

(151) 

where is the Dirac delta function. Recall that the correct definition of the 
power-spectrum is then: 

(6(^,71)6(^,7!)) = P(h, V )6 D (k l + k 2 ). (152) 

We now proceed as we did in real space by expanding 6 and 9 as perturbation 
series: 

00 

6(k,7]) = Y, an Wn(k), (153) 



n=l 

00 



6(k, V ) = Y,a(v)a n - 1 6 n (k). (154) 



n=l 



The general solution to eqs. ( |150| ) and (|151|) can be written: 



6 n (k) = J d 3 qi ...J d 3 q n 5 D (qi + . . . q n - k)F n (qi . . . q n ) ]J Sfa) 

i 

6 n (k)= J d 3 q!...| d 3 q n 5 D (q 1 + ...q n -k)G n (q 1 ...q n )n^i(q i )-(155) 

i 

The functions F n and G n must be obtained from the recursion relations: 



P — * \ Gm(<h ■ ■ ■ On) 

F n {qi ■■■qn) = 2^ 



!i (2n + 3)(n- 1) 



k ■ ki 

(1 + 2n)—j-F n _ m (q m+1 ...q n 
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k 2 k\ ■ k 2 

tv\tv 2 



G ' n—mijlm+l • ■ ■ Qn 



G n {q\ . . .q n 



G m (qi . . . q m ) 
ii(2n + 3)(n-l) 

k 2 k\ ■ k 2 



o k ■ fci , 
kf 



+n- 



k^ k 



-G r . 



2/2 ^n— mvQVn+1 ■ ■ ■ 



(156) 



where k\ = q\ + . . . q m , k 2 = q m +i + • • • q n , k = k\ + k 2 and Fx — G\ — 1. An 



expansion of this type is also given by ||213 



These expressions can be used to construct the higher-order power spectrum 
by calculating the required number of terms in the perturbation expansion for 
5 and calculating the required expectation values. For example, define 



(S m (k)6 n—m (k')) = P m ,n—m 

(k)8 D (k + k') 



(157) 



and the power-spectrum to second order can be shown to be 

P(k, V ) = a 2 (r])P hl (k) + a\ V )[P 2:2 (k) + 2P lj3 (fc)] + 0(a 6 ). (158) 

Notice that the power-spectrum to second order requires calculation of the 
fourth-order terms in 5(k). The required integrals need to be performed nu- 
merically: for specific examples see pM| , ^j2^Jl20i|359i|274^03|| . 



For an application of perturbative methods to non-Gaussian linear density 
fields, see |[206| . Perturbation series for a spherical harmonic expansion are 



discussed by [572]. Some aspects of the perturbation expansion for open Uni- 
verse models are covered by |P66|J103 



These methods are clearly useful (but a bit laborious) and can extend calcu- 
lations of statistical quantities such as the power-spectrum and higher-order 
moments onto large scales where they still show a non-linear signal but can- 
not be reliably probed by computational methods. Nevertheless, there are a 
number of problems which mean that one should look for alternative methods 
to complement such perturbative techniques. First, there is the problem of 
convergence. There is no proof that the perturbation series ( |138j ), ( |153| ) and 
([154D actually converge, nor do we know what the domain of convergence is 
if they do. As soon as the second-order contributions to the power-spectrum 
become important, so do the third- and higher-order contributions so one is 
never sure exactly how much to trust a calculation to a given order. Having 
said that, this method does seem to give results compatible with the A^-body 
simulations for a useful part of the parameter space |[274|| . Moreover, although 
useful for statistical information, the method does not provide an insight into 
the geometrical properties of the evolving distribution. For these reasons it is 
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useful to explore other dynamical approximations which might complement 
the perturbative method. 



4-2. The Zel'dovich Approximation 



In 1970, Yakov B. Zel'dovich proposed an ingenious extrapolation of linear 
gravitational instability into the nonlinear regime [ 574 ], by following pertur- 
bations in the particle trajectories, rather than in the density field. In the 
following, we will use "particle" and "fluid element" interchangeably, because 
we are applying this technique to the case of a continuous medium rather than 
a discrete set of particles. The Zel'dovich approximation has the remarkably 
simple form: 

r = ax(q,t) = a(t) [q + D + {t)u{q)\ , (159) 

where x is the final (i.e. Eulerian) comoving coordinate of a particle initially 
located at a Lagrangian co-ordinate q. In eq. ( |159|) , a(t) is the cosmic scale 
factor, and D + {t) is the growing mode of the linear density contrast, defined in 
§2. We note that an extension of the Zel'dovich approximation which incorpo- 

] . The vector field u(q) is the 



rates the decaying mode has been discussed in 
initial velocity field of the particle (u(q) = v/aD + = dx/dD + ). We confine our 
attention to initially irrotational flows so that u(q) = — V$o(^) ; where $o(<z) is 
the linear velocity potential, which is related to the Newtonian gravitational 
potential <ft by <fi = v4 _1 $o where A = 2/(3H 2 a 3 ); for a spatially flat mat- 
ter dominated Universe A is a constant (see §2.2 and ||315| ,|3il 



The Kelvin 

circulation theorem then ensures that the velocity flow remains irrotational 
at least until orbit crossing. After this, there are regions of multistream flow 
and, although each stream remains curl-free the net velocity field is no longer 
potential. Any initial rotation present in the velocity field is strongly damped 
during expansion in the period before orbit-crossing. Vorticity can however be 
generated in multistream flows within pancakes, as shown in [162|. Attempts 



to incorporate vorticity directly into the Zel'dovich approximation have been 

A generalisation of the Zel'dovich approximation based 

An application 



discussed by p2,|28 
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on the principle of least action is given in ||211|| ; see also 
of the Zel'dovich approximation to cosmic string wakes has been discussed 
in ||450|| . Relativistic generalisations of these approximations are discussed in 
84.8. 



By reparametrising the time coordinate r = D + (t), equation ( |159| ) can be 
made to look just like the inertial motion of a particle moving with constant 
velocity u: 

x = q + ru(q). (160) 

An essential feature of inertial motion given random initial conditions, is the 
intersection of particle trajectories leading to the formation of singularities 
in the density field. A similar effect is predicted also by the Zel'dovich ap- 
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proximation, and can be estimated quite simply by applying the continuity 
condition (mass conservation) 

dM = po d 3 q = p(x, t) d 3 x. (161) 

As a result we get the following expression for the density in terms of the 
Jacobian of the transformation from q to x: 



dV c 

dV, ~ 



dx 
dq 



dqtdqj 

= [1 - D+ftX!®] [I - D + (t)X 2 (q)] [1 - D + (t)X 3 (q)} , (162) 

where dV e = d 3 x and dV\ = d 3 q, are the Eulerian and Lagrangian volume 
elements, respectively, and Ai, A2 and A3 are the eigenvalues of the deformation 
tensor 

d - = dq-dq-' (163) 
The expression for the density distribution is therefore 

P(r,t) = £[l-D + {t)\ 1 ®]- 1 [1 - D+WXaffl- 1 [1 - D + (t)X 3 (q)r l 



a 



p JL(l-D + h + Dlh-Dlh)-\ (164) 



a 



where Ii = Ai + A 2 + A 3 , I 2 = AiA 2 + A 2 A 3 + A1A3 and I 3 = AiA 2 A 3 are the 
invariants of dij. The eigenvalues of the deformation tensor define a unique 
coordinate system in which a parallelipiped preserves its original form upon 
deformation. 



In general, the eigenvalues Aj can be either positive or negative. A very useful 
analytical formula giving the joint probability distribution of an ordered set 
of eigenvalues Ai > A 2 > A3, corresponding to a Gaussian potential field, was 
obtained by Doroshkevich [ |161|| , who showed that 

P(A 1; A 2 , A 3 ) = —^-= exp 1+ 2 2/ (Ai - A 2 )(A 2 - As)(Ai - A 3 ), 

(165) 

where <Tg is the initial dispersion of the overdensity field; a{t) = a(t)ao is the 
corresponding rms linear density at time t in an Qq = 1 Universe. 

A consequence of this result for a gaussian random field $0 is that the prob- 
ability for one eigenvalue to be positive and the remaining two to be negative 
is 42% (the same as that for two positive and one negative eigenvalue). A 
volume element satisfying this criterion will collapse along one direction, re- 
sulting in the formation of two dimensional pancake-type singularities in the 
density field, which arise when 1 — D + (t)Xi = 0; see Fig. (0). 
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Fig. 2. Caustics arising in the Zel'dovich approximation are shown for a two dimen- 
sional realisation of eq. (159) with random (Gaussian) initial conditions. 



We take this opportunity to remark that in this respect the Zel'dovich ap- 
proximation is very similar to the equations describing the propagation of 
light rays in geometric optics. The formation of caustics - alternating bright 
and dark regions in the intensity of light - is a well-known phenomenon in 
the latter context; see [|578| , p09|j316|| . Furthermore, the relationship between 



the singularities occuring in the Zel'dovich approximation and Catastrophe 
Theory has been explored in |JS|,|10|,|Tl],|50| . 

The situation in which two of the eigenvalues are positive and one negative 
leads to the formation of one dimensional filaments, which typically form at 
the intersection of pancakes. Collapse along all three directions (Ai, A 2 , A 3 > 0) 
has a much smaller probability of occuring (8%), and leads to the formation 
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of zero dimensional singularities - clumps, which arise when two nearby fila- 
ments intersect. The reverse situation when all three eigenvalues are negative, 
corresponds to expansion along all three directions and leads to the formation 
of voids. Properties of voids in the framework of the Zel'dovich approximation 
and in the adhesion model are discussed in §5. 

An interesting feature of the pancake model is the percolation of high density 
regions which occurs generically in the Zel'dovich approximation. For Gaus- 
sian initial conditions such a situation is rather unlikely to occur since, in 
order for regions to form a percolating network it is essential that their vol- 
ume (area) exceed 16% (50%) in 3D (2D) ||509|| . On the other hand, in the 
Zel'dovich approximation, we find that pancakes percolate despite the fact 
that they occupy a negligible fraction of the total volume. The reason for this 
is quite simple ||509|| : pancakes, filaments and clumps correspond to regions in 



Lagrangian space (L-space) where Ai > 0. For a Gaussian distribution such 
regions occupy roughly 92% of the total volume in 3D (79% of the area in 
2D). As a result such regions percolate in L-space. Since the Zel'dovich ap- 



proximation (|159|) is a topology-preserving homomorphism, the requirements 
of continuity guarantee that any such mapping q — > x which percolates in 
L-space will do so in Eulerian space (E-space) as well. 

The internal consistency of the Zel'dovich approximation can be assessed by 
means of the following argument [164|. Differentiating equation ( |159| ) twice 



with respect to time allows us to determine the acceleration of the fluid el- 
ement. A dynamical estimate of the density pd yn is then provided by the 
Poisson equation fl2"0]): V r r = — = —AixGpd yn - If the Zel'dovich approxi- 
mation were exact, the values of pd yn determined dynamically from the Poisson 
equation and p calculated using the continuity conditions (|161| - |164j) would co- 
incide. The fractional difference (pd yn ~ p)/p = $ therefore provides us with 
a measure of the accuracy of the Zel'dovich approximation. Performing this 
calculation we find that 

5 = - D\ I 2 + 2D 3 + 1 3 , (166) 
where I2 and ^3 are the invariants of the deformation tensor introduced earlier. 

From the above form for the fractional error 5 it is clear that 5 = if A2 = 
A3 = 0, i.e. the Zel'dovich approximation is exact in one dimension, and in 
higher dimensional situations when the collapse is planar. The generic nature 
of pancake formation has been strikingly demonstrated for a wide variety of 
initial conditions by N-body simulations, e.g. | [38711 . It is worth noting that 
the error term S remains finite even during pancake formation, when D + (t) ~ 
Af 1 , and p — > 00. It is useful to call pd yn the dynamical density and p the 
continuity density. Momentum conservation leads to pd yn , mass conservation 
to p. For laminar flows the Zel'dovich approximation can be suitably inverted 
and expressed in Eulerian space as q(x) = x — D + u[q(x)], the continuity 
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equation then yields [ [415 



5 = \-^\ - 1 = -D + h + D 2 + I 2 - D%I 3 , (167) 

where h,l2, Is are the invariants of the deformation tensor du(x)/dx evaluated 
in Eulerian space. The POTENT procedure was used in [|415|| to determine 
u(x); then eq. ( |167| ) was used to reconstruct the form of the density field in 
the weakly non-linear regime. 

Evaluating the acceleration according to the Zel'dovich approximation ( |159| ) 
and substituting its value in the Euler equation (|16l), we obtain Sd yn = Pd yn / P~ 
1 = —Vv/Hfa (v = aD + u) which, surprisingly, is also just the linear velocity - 
density relation (|2ll). It can be shown [|415|| that in general, Sd yn overestimates 



the real value of the density, whereas 5 tends to underestimate it, so that 
(5 + 5dyn)/2 often provides a better estimate of the actual density contrast 
than either 6 or Sd yn - 

At relatively early times when the density contrast is still linear D + (t) < 1, 
the higher order terms in eq. ( |164| ) - b 2 l2 and $1% - can be ignored relative 
to bl\, with the result that eq. (|164| ) reduces to 

p{x,t)~^{l + D + h) (168) 
a 



or, more usefully, 



^ ~ D + (t)h = D + (t)(Ai + A 2 + A 3 ) = D+(t)V 2 %. (169) 
P 



Since $0 oc 0, we recover the linearised Poisson equation. Comparing eq. ( 169|) 
with eq. ( p.64j ), we find that the predictions of linear theory and the Zel'dovich 
approximation (ZA) as to the formation sites of the first nonlinear objects are 
somewhat different. Whereas the Zel'dovich approximation predicts that the 
first pancakes will collapse near high peaks of the largest eigenvalue Ai, linear 
theory predicts that the first nonlinear objects will form near the maxima of 
Sp/p oc (Ai + A2 + A3). For very asymmetric configurations of the primordial 
density field (Ai >> A2, A3), the results of ZA and linear theory are expected 
to correlate, however important differences can also arise especially in regions 
where some of the eigenvalues are negative. For instance regions in which 
Ai > and A 2 ,A 3 C so that < 0, will remain forever underdense 

according to linear theory, yet would correspond to sites for pancake formation 
according to ZA. 

An interesting feature of one dimensional pancake collapse is that the surface 
density of pancake matter remains finite even as the density within pancakes 
approaches infinity. (For planar collapse the density profile in the vicinity of 
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a pancake located at x\ is p oc \x — Xi\~^.) This leads to finite, well denned 
values for both the gravitational potential and the gravitational force field 
outside pancakes. A discussion of the physics at the sites of caustic formation 
can be found in ||165| , |6T| , pT^ , [579|| . 



Using eq. ( |165| ) in conjunction with eq. (|164|) it is possible to derive an expres- 
sion for P(p, a) - the probability distribution function for the density (PDF) 
in the Zel'dovich approximation |pl6|J317| , |37| : 



9 x 5 3 / 2 
4irp~ 3 N s a 4 



dse 2 CT 2 (1 + e ^)(e 2^+e 



A. 



(170) 



where a is the variance, defined by 



a 2 (R,t) =4irD 2 + (t) J P(k)W 2 (kR)k 2 dk, 



and 



(3 n {s) = Vh s ( \ + cos 



— in — l)7r H — arccos(54/ ps — 1 



(171) 



(172) 



in which p = p/p and N s is the mean number of streams in the flow; N s = 1 
in the single stream regime indicating the absence of shell crossing. 



For small values of the variance a<lwe recover the linear result 



Po(p) 



27TCT 



exp 



(P ~ P ? 
2a 2 



(173) 



The Gaussian density distribution described by equation ( |173j ) is valid only at 
early times when departures from the mean density are small. At late times 
S p/p can become as large as we like in regions where Ai > 0. In regions 
where Ai < (i.e. voids) on the other hand, the density is always bounded 
from below by p > 0, so that Sp/p > — 1. As a result, the initially symmetric 
distribution ( |173|) gradually develops an asymmetry and the probability distri- 
bution function, equation ( |17(J| ) grows progressively more non-Gaussian with 
time; see Figure (|]). It is interesting to note that for large densities p » p 
eq. ( |170| ) has the asymptotic form P(p, t) oc p~ 3 . Since the density distribu- 
tion near a pancake will depend upon the distance 'x' to the pancake we get 
P(p)dp oc dx which gives p oc x _1//2 , which correctly reproduces the density 
profile in the vicinity of pancakes. A discussion of the density PDF in ZA after 
final smoothing can be found in |]37| ; also see [|421|[ . 



For small \p — p\, it can be shown that [ |3171| the probability density function 



resembles the log-normal distribution ||127|| . This is demonstrated in Fig. @ 



where the probability distribution function for the density is shown both for 
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Fig. 3. The probability density function for the lognormal (solid line) and ZA 
(dashed line) for three different smoothing lengths: R s = 18, 12, 6/i~ 1 Mpc., corre- 
sponding to a = 0.11, 0.26, 0.55. Filled regions correspond to the results of iV-body 
simulations of a standard CDM model. Reproduced, with permission, from [317]. 



the lognormal distribution and the Zel'dovich approximation. When we com- 
pare these two approximations with iV-body dynamical simulations, we find 
that for small values of the smoothing length corresponding to o < 0.26 ZA 
models the A^-body results somewhat better than the lognormal distribution, 



whereas for larger values of a the reverse is true ||317 



In contrast to the density distribution, the one-point velocity distribution 
function retains its original Gaussian form to a large extent, even in the non- 
linear regime, until orbit crossing. From the observational viewpoint, the prob- 
ability distribution functions for the density and the velocity could be used to 
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statistically discriminate between initially non-Gaussian distributions (such 
as those predicted by string/texture models of galaxy formation), and dis- 
tributions that evolved from Gaussian initial conditions. Preliminary results 
based on the POTENT reconstruction (§8.10) and the IRAS survey (§8.1) 
have been obtained in ||316| , |317|| ; they indicated that models with Gaussian 
initial conditions are compatible with present observations. 

The Zel'dovich approximation also leads to interesting evolution equations for 
the gravitational and velocity potentials in the non-linear regime. To demon- 
strate this, we assume that the only operative mode of gravitational instability 
is the growing mode. In linear theory this guarantees that the velocity field 
is irrotational. Requiring this to be the case in the non-linear regime as well 
(this is in fact guaranteed by Kelvin's circulation theorem until shell crossing) 



and rewriting the Euler equation and the Bernoulli equations ([16]) and (|17[) 
in terms of the comoving velocity u = (v/aa) = dx/ da and the comoving 
velocity potential u = — V$, $ = (fi v /(a 2 a), we obtain (for a pressureless 
medium) 

du -* 3 r - i 

— + {uV x )u = -—[AV x <f) + u\ (174) 

fa -!W=^-*)' < 175 > 
where A = (3H 2 a 3 /2) 1 . The quantity A does not depend upon time if the 
Universe is flat and matter dominated. Similarly the continuity and Poisson 
equations become 

^ + V x (u V ) = (176) 
oa 

V 2 0=^, (177) 
A a 

where rj = a 3 p is the comoving density. As mentioned earlier, the expression 
for the dynamical density in the Zel'dovich approximation reduces to the lin- 
ear expression: 5d yn = —vv/(Hfa) [ 415| , when combined with the Poisson 



equation ( |177| ) this leads to the following relationship between the velocity 
and gravitational potentials (in a spatially flat matter dominated Universe): 
$ = Acj) which is equivalent to saying that the peculiar fields for the velocity 
and the acceleration are parallel to one another. (In the one dimensional case, 
$ = Aip is exactly true even in the nonlinear regime. As a result the Zel'dovich 
ansatz ceases to be an "approximation" and provides an exact treatment of 
nonlinear gravitational instability in one dimension, until particle trajectories 
cross) . Substituting this result into the Euler and Bernoulli equations fllED and 
(|T7|), we obtain: 

^g + (uV)u = 0; (178) 
^ - KV,$) 2 = 0; (179) 
^-|(V,0) 2 = O. (180) 
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Equation (173) gives du/ da = 0, (d/da = <9/<9a + uV x ), implying that the 
particle motion is inertial. The equation for the velocity potential $ has the 
following interesting analytical solution: 

$(f,a) = $ (g)-fc-^ (181) 

The gravitational potential evolves according to an identical equation. Differ- 
entiating eq. ( 181|) alternately with respect to x and q, we finally get 



x = q - a{t)V q <& 

u = -V x ${x, a) = -V 9 $ (g), (182) 

recovering the Zel'dovich approximation. The fact that the velocity is always 
parallel to the acceleration vector makes particles move along straight lines in 
the Zel'dovich approximation. 

It is interesting that the evolution equation for the potential ( |179| ) and its 
solution ( |181|) allow one to reconstruct the primordial form of the velocity po- 
tential $0 from its present value $. (The same applies also to the gravitational 
potential 0.) Attempts to do this using the POTENT procedure have been 
discussed by ||413| . This involves an integration of the Zel'dovich-Bernoulli 
equation ( |179| ). 



As demonstrated recently by Gramann |[225|| a somewhat different evolution 
equation for the gravitational and velocity potentials would result if we were to 
substitute the expression for the continuity density in the Zel'dovich approx- 
imation (|167|) into the Eulerian continuity equation (|1 76|) . The relationship 



between the gravitational and velocity potentials would then become 

$ = <p - a(t)C g (183) 
(we put A = 1 for simplicity), where the function C g is determined from 

~d 2 d)d 2 6 , d 2 d> , 



2 3 



v 2 ^ = E E 

i=l j=i+l 



dx 2 dx 2 . dxidxj 



;i84) 



the corresponding evolution equation for the gravitational potential would 
then be modified to 

Y a - = °9- (185) 

The Zel'dovich-Continuity equation (|185|) appears to be an improvement over 
the Zel'dovich-Bernoulli equation ( |179| ) when tested against N-body simula- 
tions ||225|| . (One can replace a(t) by D + (t) for greater generality in the above 



equations.) 

We would like to conclude this discussion of the Zel'dovich approximation by 
mentioning that, from the point of view of an observer, the peculiar velocity 
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flow associated with overdense regions can give rise to redshift space caustics. 
Such caustics arise when overdense regions cease expanding along our line 
of sight and begin to contract. Investigating this effect in the linear regime, 



Kaiser ||287|| showed that for a spherical density enhancement, redshift caustics 
occur just inside the turnaround radius. Some insight into this effect can also 



be obtained using the Zel'dovich approximation. Differentiating eq. (|159|) with 
respect to time we get 



V 



dr 
dt 



Hr-aD+V<&. 



(186) 



The second term in the right hand side of eq. ( |186|) indicates the departure 
from the pure Hubble flow: Vjj = Hf. Aligning our coordinate system so that 
the radial direction f is along the z-axis, we obtain the following expression 
for the redshift distance to the particle (or galaxy) 



V ^ 



f a(t) D + (t) 



aq 3 



[l + f)a(t)D + (t) (V$) 3 , (187) 



where / is the usual growth factor, / ~ (see §2.2 eq. (|22|) ). Following the 
deformation of a unit volume element from the Lagrangian coordinate system 
(qi, q 2 , q 3 ), to the modified Eulerian coordinate system (r±, r 2 , z = V/H), and 
using the continuity equation, we get p35 



P(r, t) = ^ det 



/ 



1 — D + d n —D + d 12 —D + di 3 

-D + di2 1 - D + d 2 2 -D + d 23 

(1 + f)D+d 13 -(1 + f)D+d 23 1 - (1 + f)D+d 33 



(188) 

where c?^ denotes the deformation tensor, defined by eq. ( |163| ). In the distinc- 
tive case when the eigenvectors corresponding to the eigenvalues Aj, A 2 , A 3 are 
aligned along the line of sight, equation (|188|) simplifies to give 



D+Ax)-^! - D + X 2 )-\l - D + (l + /)A 3 ) 



(189) 



From eq. ( 189 ) it follows that a redshift singularity along the line of sight 
will arise when D = D z = [(1 + /)A 3 ]- 1 ~ [(1 + ^-^Ag]" 1 , since D z < 
— I/A3, a line of sight singularity will always precede the formation of a 
real density singularity. This is a simple consequence of the fact that collapse 
must be preceded by "turnaround", i.e. the expansion must stop before the 
contraction can begin. The overdensity of matter at the time of formation 
of the redshift singularity can be obtained from eq. (|164j) , in particular for 
n = 1, D z = 1/2A 3 and p z ~ p /a 3 (l - D Z X 3 ) = 2p /a 3 , giving 8 ~ 1, 
which corresponds to the extrapolated linear overdensity of a spherical mass 
fluctuation at turnaround. 
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Although the Zel'dovich approximation is extremely accurate in predicting the 
density field until the formation of pancakes and other caustics, its validity 
breaks down soon after shell crossing. Improvements and extensions of the 
Zel'dovich approximation can be broadly divided into two main categories: 

- those that like ZA are formulated within the Lagrangian formalism; 

- those which have been conceived within the Eulerian framework. 



To the former belong the truncated Zel'dovich approximation and the adhesion 
model |23?| , 238] as well as approximations based on Lagrangian perturbation 
theory ||401|, 91, |92 , [78| . To the latter - the so-called frozen flow and linear po- 



tential approximations |P71|,|34|.|T5[| . 



4-3. The Truncated Zel'dovich Approximation (TZA) 



An important feature of hierarchical clustering is that not all scales are in the 
non-linear regime at any given epoch. The simplest extension of the Zel'dovich 
approximation - the truncated Zel'dovich approximation (TZA) - is based 
upon this fact, and the observation that modes contributing to the breakdown 
of ZA are those that have gone nonlinear at a given epoch. By selectively 
filtering all modes with k > k^L (where k^L is epoch-dependent), we can 
prevent shell crossing and the consequent "smearing out" of caustics that 
occurs in the straightforward ZA ||128| , |321| , |386 |. 



If one chooses the criterion Sp/p ~ 1 to mark the onset of non-linearity then 
the non-linear length scale k~^\ can be determined from the equality 



a{R,t) = {{6p/p) 2 )*=D + (t) 



4vr J P(k)k 2 dk 

V k H 



(190) 



where ku = 27ra(£)/A# is the comoving horizon scale, P(k) is the power 
spectrum. For power law spectra P{k) oc k n 



k- N \ocDf^\t). 



(191) 



Clearly an increase in D + {t) will be accompanied by an increase in k 
eating that, as the Universe evolves, successively longer wavelength modes go 
nonlinear. 



TZA has been compared with ZA and N-body simulations ||128| , p86|J493|| . The 
result of these studies is that TZA performs considerably better than ZA, 
particularly at late times and for primordial spectra with more large scale 
power, when the evolved matter distribution is compared point-by-point with 



63 



a full iV-body simulation with the same initial conditions. The form of filter 
which optimises TZA seems to be a Gaussian window: exp[— k 2 /2ko\ {k^L < 
kc < l-Sfejvx,) applied to the initial power spectrum P{k) ||386|| . The success 
of TZA may at least partially, be attributed to the fact that the evolved 
gravitational potential resembles the smoothed initial potential much more so 
than it does the unsmoothed initial potential. As a result although small scale 
features are wiped out in TZA, large scale features are accurately described 
by it, since particles are made to move in a smoothed potential which closely 
resembles the true evolved potential at any given time ||428| , |385|| . An application 
of TZA to the theory of the clustering of Abell clusters is described in J75l . 



An interesting application of TZA for determining the typical mass displace- 
ment of objects d rms was suggested by Shandarin ||506|| . From ZA ( |159| ) one 
finds 

drms = ((x-q) 2 }^ = D + (t)u rms 

Urms = (u{q) 2 )^. (192) 

Smoothening the original velocity field on non-linear length scales k > fcjvL 
we get 

kNL 

u 2 rms (k NL ) = Ait J P{k)dk. (193) 

ha 

Combining eqns. ( |190| ), ( |192| ) and ( |193j ) gives 

^2 n \ P{k)dk /-iru\ 

d ™ (kNi) = e P(il w (194) 



It is interesting that d rms can sometimes be greater than the non-linear length 
scale fcjvL- This can be clearly seen for power-law spectra with sufficient large 
scale power for which (|194|) reduces to 



i 



n + 3\ 2 / k H \ - j 



n 

2 



drms(k NL )=[ — 7^ k~ N \ -l>n>-3. (195) 

V n+lj \k NL J 

Thus clumps (with a mass given by ( |219| )) can move coherently travelling a 
distance ~ d rms >> kjj\ for n < —1. As pointed out by Shandarin [|506|1 this 



might provide an explanation for the presence of large coherent structures 
on scales which are still evolving according to linear theory in cosmological 
scenario's such as CDM. It is interesting to note in this context that the scale 
at which the phases of 5k can depart significantly from their original values 
is close to d rms P69| , p06|| . Comparatively little is known about the precise 
behaviour of the phases of the Fourier components, however, and this is one 
of the big gaps in our knowledge of the clustering process [ |44(J|j534|j501| , |520| . 
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4-4- Lagrangian Perturbation Theory 



The great success of ZA is at least partly due to its being formulated in 
Lagrangian space rather than in Eulerian space. This allows one to follow 
particle trajectories and the accompanying evolution of overdensities for a 
much longer period of time in ZA - effectively until shell crossing when 5 — > oo 
- than was possible in an Eulerian framework which usually breaks down when 
5 ~ 1. Pursuing this logic further, a general Lagrangian perturbation approach 
in which, as in ZA, the fundamental perturbed quantity is taken to be the 
displacement field rather than the overdensity might be expected to be more 
accurate Mf^P,P,^,^,^,p^ j2^ , |224| , ^03| ,Fll|l0l|1 ■ 



To demonstrate some aspects of this approach let us consider the dynamical 
behaviour of particle trajectories in Lagrangian space. We assume for simplic- 
ity that the Universe is spatially flat and matter dominated. Extensions of 
our argument to a curvature dominated Universe may be found in |78| . As in 
ZA, we assume that the comoving Eulerian coordinate of a particle x and its 
comoving Lagrangian coordinate q are related by a displacement field 



x = q + ${t,q). 



(196) 



Introducing the displacement matrix 



M ik {t, q) = — = b ik + ^— 
oqk oq k 



(197) 



we find that Mi k satisfies the equations 



d_ 

dt 



(198) 



e ikl M km M-} = 0, (199) 

where J = \ detM^| is the Jacobian of the transformation eq. ( |196| ), and 
€i k i is the Levi-Civita tensor. Equation ( |199| ) is the condition for potential 
motion in Eulerian space ||579|| . The resulting density and velocity fields can 
be determined from 



Po 



= 5 



J 



-i 



u 



dx^ 



a*. 



(200) 



In the weakly nonlinear regime, equations (|198|) & ( |199| ) can be solved by 
expanding ^ in a series ^ = +\l>( 2 ) + ..; higher orders in ty( n ' (n > 1) being 
related to lower orders via an iterative procedure [|401| , p^j329|| . In a spatially 
flat matter dominated Universe are separable at every perturbative order 



(n) 



Dl(t)4 n) (0, D + (t)oca(t), 



(201) 
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the lowest order solution being the Zel'dovich approximation 

oqi 



(202) 



The second order terms \I/ = NE^ 1 ) + \E , ^ 2 - ) give rise to what we might call the 
post- Zel'dovich approximation (hereafter PZA) where 



W;? = -^[(iW) 



(1)N2 



(203) 
(204) 

Derivatives are defined the with respect to the Lagrangian coordinate q; sum- 
mation over repeated indices is assumed. As a result, 



(2) 
3,i ' 



X ; 



(205) 



The third-order Lagrangian approximation (or the post-post-Zel'dovich ap- 
proximation, hereafter PPZA) results in ^ = J2n=i where 



(206) 



(207) 



283|,p4|. It should be mentioned that, beginning with the third order approx- 



imation, the particle displacement field is no longer potential in higher order 
Lagrangian approximations. (In one dimension ip^ = ijj^ 2 ' = 0, a consequence 
of the fact that the Zel'dovich approximation is an exact solution to the equa- 
tions of motion (until shell crossing) in this case.) 

Many of the results originally derived for the Zel'dovich approximation have 
recently been extended to higher order Lagrangian theories and especially 
to the post-Zel'dovich approximation (PZA) p3l,p5|, p83jp29| , p2^j403|] . Spheri- 
cal collapse in Lagrangian theories has been investigated in | 403 |; see §4.7. A 



treatment suitable for non-flat Universes is presented in [ IUT | . The moments of 



the density distribution have been calculated in |78| . |103| . |258| ; see §4.7. One can 



also incorporate the PZA into the POTENT reconstruction ansatzby deriving 
an analytical form for the density- velocity relation in PZA [|224j| . Higher-order 



Lagrangian theories have been compared with other approximations, including 



the frozen flow (FF) and linear potential (LP) approximations ||403|| ; the con- 
clusion is that such Lagrangian methods invariably perform better than either 
FF or LP in the weakly non-linear regime (see §4.7). The great strength of 
the ZA when compared with these other methods is that its simplicity allows 
many properties to be calculated analytically. 

As was the case with the Zel'dovich approximation, higher order Lagrangian 
approximations break down soon after shell crossing and the formation of 
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the first pancakes. This problem can be partially remedied if (as in ZA) we 
introduce a cutoff scale in the original power spectrum which filters all scales 
that are going non-linear at a given epoch, the resulting approximations are 
then the truncated versions of PZA, PPZA etc. fl96| . 



4-5. The Adhesion Approximation 



As discussed in §4.2 the Zel'dovich approximation works very well until the 
formation of the first nonlinear structures - pancakes. Since particle motion is 
essentially inertial in ZA, particles passing through one and the same pancake 
do not feel each others gravitational influence. As a result, the stabilising effect 
which high density regions (caustics) have on particle trajectories is absent in 
ZA, and particles are allowed to sail through caustics and are even pushed 
away from them after passing through. This leads to an artificial broadening 
of pancakes, and results in their ultimate disappearence. This limitation is 
shared by extensions of ZA such as the post Zel'dovich approximation etc., 
which break down order by order, soon after shell crossing. 

This tendency is countered in the adhesion model (hereafter AM) - a nonper- 
turbative extension of the Zel'dovich approximation - in which particles are 
made to stick together once they enter into caustics, mimicking the viscous 
effects of gravity on small scales. As a result pancake thickness is stabilised in 
the adhesion model, just as it is in N-body simulations. 

In order to see how this is accomplished we note that the combined effect of 



the terms in the right hand side of the Euler equation ( |174j ) is like a force 
whose presence helps to keep the thickness of pancakes finite. In the ZA the 
right hand side in eq. ( |174| ) is set equal to zero so that the resulting motion 
of a particle is inertial: du/ da = 0. Solving the coupled set of nonlinear 
equations (|174J - |176| ) is not mathematically feasible. However, by replacing the 
right hand terms in equation ( |174| ) by a mock viscosity term introduced to 
mimic the adhesive effects of nonlinear gravity on small scales, we obtain 

— + («V)m = uV 2 u. (208) 



Equation (j208|)is well known from studies in turbulence as the three dimen- 



sional generalisation of Burgers' equation |97|J566| . For potential motion u = 
— V$ which we shall be considering, eq. ( |208| ) can be solved analytically by 
performing the Hopf-Cole substitution a) = —2i/logU(x,a). As a result 
eq. ( |208| ) translates into the familiar diffusion equation 

8U 

?P = uV 2 U. (209) 
oa 
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Solving eq. ( |209|) (omitting some intermediate steps) we obtain [[237|J238 



a) = -1v loj 

u(x, a) 
where the "action" 



—a- f d 3 q exp(-^-S{x,a;q 

/d 3 q(^)exp[-S(x>;q)/2z/] 
/ d 3 qexp[-S(x,a;q)/2z/] 

(x — q) 2 



S(x, a;q) = -® (q) 



2a(t) 



(210) 
(211) 

(212) 



satisfies equation ( |179| ). The trajectory of a particle can now be determined 
from equation ( |211p by solving the integral equation |412| ,[ 



a 

x(q,a) = q + J da'u[x(q, a'), a' 



(213) 



and the resulting density can be determined from the continuity equation 

p(x,a)=p (q)\dx/dq]-\ (214) 

For small (but finite) values of the viscosity parameter v — > 0, the integrals 
in equations ( [21 0j ) and ( |211|) can be evaluated using the method of steepest 
descents. (Note that a small v implies a large Reynolds number R = u^jv, u 
and Iq being respectively, the characteristic amplitude and coherence length of 
the primordial velocity field [|239|1 .) The resulting solutions acquire a simplified 
form p3l , p8|j320| , |32ll : 



and 



$(£,a) = -2z/log ^j a exp (-^-S(x, a; q a )\ 
(^) 3a exp[-^S(x, a; q a )} 



u(x, a) 



(215) 



(216) 



E a ja^l?[-^S(x : a;q a )] 
where q a are the Lagrangian coordinates that minimise the action ( |212| ), i.e. 



S a = S(x,a;q a ) = -$ (<?o 



det (5 ij — 



(x - g a y _ 

2a(t) 
<9 2 $ 1 



mm, 



dqidqj 



2 



(217) 



The points q a that minimise the action satisfy the Zel'dovich equation x(q a , a) = 
q a - aV,f (9«); S(x, a;q)> S(x, a; q a ) for any q ^ q a . 

Equations (|215 ) and (|216| ) have an interesting geometrical interpretation ac- 
cording to which the formation of caustics in E-space can be related to the 
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degree of the mapping from {q a } to {x} - the apex of the paraboloid (Fig. 
@). At early times this mapping is one-to-one, since the curvature of the 
paraboloid P(x, a; q) is greater than that of the primordial potential O ( or 
equivalently $o)- At such times the Zel'dovich approximation is universally 
valid. With passing time, as the curvature of the paraboloid decreases it be- 
gins to touch the potential in more than one point, resulting in a mapping 
from {q a } to {x} which is many-to-one. This indicates the onset of the epoch 
of pancake formation. A pancake forms at x when the paraboloid (with apex 
at x) touches the potential in exactly two points (so that the mapping from 
{q a } to {x} is two-to-one). Filaments form at the intersections of pancakes; for 
them the mapping is three-to-one, and knots at the intersections of filaments, 
the mapping being four-to-one. The density of matter in these structures is 
directly related to the degree of the mapping, so that knots are denser than 
filaments, which in turn are denser than pancakes |476|. 



The masses and velocities of knots can be obtained in this ansatz as follows. 
In two dimensions the mass of a knot at x is given by M(x) ~ pA(<fi, q 2 , 53), 
where A(<fi, g 2 , (ft) is the area of a triangle with vertices at $.,52, and 
p(t) is the mean density of matter in the Universe. In three dimensions, 
M(x) ~ p D(q\,q2,q3,qi), where D(q\, q 2 , ^3, q±) is the volume of a pyramid 
with vertices at q\,q2, <73><?4- Similarly, the velocity of the knot U in two di- 



mensions can be determined by solving the following system ||321|| of algebraic 
equations: 

U-iqi-qV) = $0(92) - $o(<fi) 

U-(qi- <f 3 ) = $o(<r 3 ) - M&) ( 218 ) 

(and similarly in 3D). Equation ( |218| ) describes the velocity U in terms of a 
vector defined in a three dimensional space ($>o,q). The magnitude of U is 
proportional to the area of the triangle constructed by joining the osculating 
points of the potential $0 (<?<*)• Its direction is given by the projection of the 
normal to the plane of this triangle onto the Lagrangian plane defined by the 
vectors (q\, (fa). In one dimension a knot is formed at the apex of a paraboloid 
touching the potential in exactly two points. As a result its mass is given 
exactly by m = (\q\ — q 2 \) x p, and its velocity by u = dx/ da = n(|0 qi — 
q2 |)/(|qi — q2 1 ) , the direction vector n pointing from higher to lower values 
of the gravitational potential. The mass function of knots evaluated using the 
adhesion approximation, shows good agreement with the Press-Schechter mass 
function for a wide range of spectra, in both one and two dimensions [^69|J321 



The formation and evolution of large scale structure is described by the ad- 
hesion model as a two stage process. During the first stage matter falls into 
pancakes and then moves along them towards filaments and then along fila- 
ments to collect finally in knots. At the end of the first stage the formation 
of the skeleton of the large scale structure is complete and virtually all of the 
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matter in the Universe is located in one of three structural units: pancakes, 
filaments or knots. This happens at the epoch when kjf\ ~ = spVaxja^^ 
characterising the average separation between peaks of the gravitational 
potential. (The quantities cr , o"i,<t 2 are the standard deviations in the poten- 
tial and its first and second derivatives respectively; T> is the dimensionality 
of space.) The second stage sees the deformation of the large scale structure 
skeleton due to the dynamical motion of pancakes, filaments and especially 
knots towards each other, caused by their mutual gravitational attraction. This 
stage witnesses the merger of knots and the disappearance of small voids, il- 
lustrated in Fig. (|TTD for the one dimensional case. During this stage the 
paraboloid is extremely broad and can therefore feel very large features in the 
primordial gravitational potential. As a result, the corresponding picture of 
large scale structure often shows the presence of large "secondary pancakes" 
whose size is related to the characteristic correlation scale of the initial poten- 
tial i?^ = VVa /ai ||321|| . Secondary pancakes should be distinguished from 
primary pancakes which form at the onset of gravitational instability. 



In addition to providing a simple geometrical means to determine the masses 
and momenta of clumps, the adhesion model can also be used to determine the 
sizes of voids and the associated void spectrum for a given cosmological model. 
Some properties of voids obtained using the adhesion ansatz are summarised 
in 55.3 (see also IH75I). 



Statistical properties of solutions to Burgers equation have been studied, both 
in and out of the cosmological context ||306| , |236| , ^37| , p38|JT63|J546|| . Numerical 
studies aimed at determining, among other things, the velocity distribution 
and mass function of knots have been performed in one dimension [ |569| | and in 



two dimensions ||321|| . In addition, the evolution of voids ||476|| and filamentary 
structures |[412|J321|| have been discussed at length. 



An interesting result that emerges from these studies is that for power law 



spectra P(k) oc k n the typical mass in knots grows as ||237|J238| , |306| , |509|| : 



M oc a(t) 6/(n+3) 



1 < n < 1 



(219) 



For steeper spectra n > 1 the adhesion model predicts M oc a 3 / 2 P9U5091 . 
It is interesting to compare the predictions of the adhesion model with those 
arising from semi-analytic models of hierarchical clustering such as the Press- 



Schechter approximation ||461|| , discussed in §3.4. PS suggests that equation 
( |219| ) is valid over a somewhat broader range — 3 < n < 4. For very steep 
spectra n > 4 PS predicts the limiting rate M oc a(t) 6 / 7 which is slower than 
the limiting growth rate predicted by the adhesion mo del .["^j 



19 As in §4.2 the preceeding discussion can be made more general by replacing a(t) with 
D + (t), the growing linear density contrast. 
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Fig. 4. The geometrical representation of a paraboloid descending onto the 
(two-dimensional) gravitational potential in order to demarcate sites of structure 
formation. The paraboloid is tangent to the potential in Lagrangian space, its apex 
shows the location of caustics (filaments and clumps) in Eulerian space. Adapted, 
with permission, from |32C[|, 



The adhesion model has been found to work very well in explaining the large 
scale features of the structure of the Universe. Comparison with N-body 
simulations has shown remarkable agreement till very late nonlinear times 
|559| . |560| , [474| , pi7|j476|| . The main drawback of the adhesion model is proba- 



bly the fact that one cannot follow particle trajectories within pancakes and 
filaments, which remain arbitrarily thin in models where the coefficient of vis- 
cosity v is taken to be very small. Note that v has dimensions of (length) 2 , so 
that the thickness of pancakes is proportional to y/u. 
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4-6. The Frozen Flow and Linear Potential Approximations 



In addition to the adhesion approximation, other recent attempts to prevent 
the artificial thickening of pancakes present in ZA, have led to the development 
of the Frozen Flow (henceforth FF) and Linear Potential (LP) approximations. 
Unlike the Lagrangian space based approximations considered earlier, both FF 
and LP are formulated in Eulerian language. For FF see ||371| |; LP is discussed 



in Ipiyiol . In FF a particle moving along a fixed trajectory constantly upgrades 
its velocity to the local value of the linear velocity field. The equations of 
motion in FF can be derived very simply from equation ( |174| ) by neglecting 
the non-linear term (u'V)u and setting u = -AV^o so that 

du 

Ya = °' < 220) 

As a result the comoving velocity field of a particle remains frozen to its ini- 
tial value u(x, t) = u(q = x). Since particles loose memory of previous motion 
they behave as inertialess tracers of the primordial velocity field which is a 
fundamental reversal of the assumptions made in the Zel'dovich approxima- 
tion. Continuity demands that the component of the force field perpendicular 
to the plane of the pancake - V(0o)± vanish at sites of pancake formation. 
As a result particle velocities directed towards the pancake get progressively 
smaller as the site of pancake formation is approached. The velocity compo- 
nent parallel to the pancake however remains unaffected with the result that, 
as a particle approaches a pancake site it slows down and modifies its velocity, 
in such a manner so as to move along the pancake towards lower values of the 
gravitational potential, converging ultimately in a clump at a local minimum 
of 0o- This asymptotic convergence guarantees that particle trajectories will 
never intersect in FF and that the resulting pancake will always remain thin. 
A drawback of FF is that, with the passage of time most particles collect in 
minima of the potential where u = 0, as a result no further evolution of large 
scale structure is possible within the framework of this model. 

The density contrast in FF can be estimated from the continuity equation 
(|18|) or (|176|) which gives 



Substituting V« = — V 2 x 4>q(x) = —5{x)/a = constant, we finally get 

ao 

where 5 + is the unevolved value of S(x)/a in the linear regime, i.e. the value 
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of the linear density field integrated over the trajectory of the particle equals 



the logarithm of the density. Equation (222) presents a generalisation of the 



log-normal distribution ||127|| according to which the linear and nonlinear den- 



sity contrasts are related through T](x,t) = rj Q exp[5(x,t)], S(x,t) being the 
linear density contrast at a fixed coordinate value x. In fact |[403|| the proba- 
bility distribution function in FF in the weakly nonlinear regime reproduces 
the log- normal result (|133|) for a <C 1. Both FF and the log- normal distribu- 



tion give similar results when applied to particles located at local maxima or 
minima of the initial gravitational potential when there is little change in the 
particle distribution. Since at early times voids can be associated with peaks 
in the gravitational potential, FF predicts 5ff — exp[— a(t)} — 1 — > — 1, 

indicating that voids are evacuated fairly rapidly. Comparison with the spher- 
ical top hat model shows that FF leads to a somewhat faster evacuation of 
voids than predicted (see §4.7 and §5.1). 

The linear potential approximation (LP) is based on the observation that 
the gravitational potential usually evolves much more slowly than the density 
field. This is clearly indicated by the linearised Poisson equation V 2 y? oc 5 /a, 
(iS < 1) , which demonstrates that <p = <po = constant, if the Universe is 
spatially flat and matter dominated so that 5 oc a. The Euler equation in this 
case simplifies to 

^ + f- = -fv0 o . (223) 
da 2a 2a 

indicating that particle trajectories follow the gradient of the primordial poten- 
tial - effectively moving along its lines of force. Unlike FF, particle trajectories 
in LP can and do intersect at sites of pancake formation, which correspond 
to the directional minima of the primordial gravitational potential. Particle 
motion is oscillatory in such regions, which prevents the resulting pancake 
from becoming too thick. As in FF, the subsequent motion of particles within 
pancakes follows the primordial potential, with virtually all the particles col- 
lecting in clumps, corresponding to local minima of the initial potential, at 
late times. (Note that in both FF and LP particle trajectories can be curved, 
which sets them apart from the straight line trajectories of the Zel'dovich 
aproximation.) 

An essential limitation of both FF and LP is that, since in both approximations 
particles move according to the dictat of the primordial velocity/gravitational 
potential, neither is equipped to describe the merger of clumps, which is the 
predominant feature of gravitational instability at late times, especially in 
hierarchical models such as CDM. This limitation is not shared by the adhesion 



model as demonstrated in ||321|J476|,^493||; see also §4.7. 



A common feature shared by all the approximations discussed in this chapter 
(i.e. ZA, PWZA, AM, FF and LP), is that they allow a fully parallel treatment, 
making them ideally suited for efficient implementation on highly parallel 
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computers. 



4-7. A Comparison of Non-linear Approximation Methods 



The relative accuracy of nonlinear approximation methods can be established 
by testing them both against each other as well as against exact theoretical 
predictions (where these are available) or alternately, against the results of 
N-body simulations. 

In the weakly nonlinear regime, the predictions made by higher-order Eulerian 
perturbation theory can be used to test the accuracy of different approxima- 
tion methods \ 227[ 405 , 403| , |38|1 . In the strongly nonlinear regime on the other 



hand, comparison with N-body simulations can serve to discriminate between 
the different approximations p8jp3jp9| , p^ j96|j386|,P,|390i . 

In the weakly nonlinear regime (5 < 1) the density field can be described 
by means of a perturbative expansion discussed in §4.1. To compare our ap- 
proximations with the full perturbative expansion, we simply need to solve 
the equation describing spherical top-hat collapse for the different approxima- 
tions and compare the resulting generating functions term-by-term with the 
full perturbative result given by Eqn. ( |144|) and Eqn. (|146|) . One can also then 



proceed to determine the S n in the quasi-linear hierachy using Eqn's. ( 142 ) 
& (11431), which we shall do later. 



Let us now sumarise the results of this procedure ||403|| . For the straightforward 
Zel'dovich approximation we find 



T , -3 



Gs=(l--) - 1; (224) 
for the post-Zel'dovich approximation (PZA, see §4.4) this becomes 

Gs = (l-^-^r) - 1; (225) 



and for PPZA, 



G*=(i-£-£-tI£) 1 (226) 




3 21 1701 

(Everywhere Gs = 6 sp h the density contrast in the spherical top-hat model. 
For the approximations formulated in Eulerian space we get: 



G 5 = I ) - 1 (227) 



sin v 2r 
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Table 2 

Comparison of the hierarchical coefficients S n for the various approximations dis- 
cussed in the text. 





Exact 


ZA 


PZA 


PPZA 


LP 


FF 


s 3 


4.857 


4 


4.857 


4.857 


3.4 


3 


Si 


45.89 


30.22 


44.92 


45.89 


21.22 


16 


s 5 


656.3 


342.2 


624.4 


654.6 


196.4 


125 




12653 


5200 


11666 


12568 


2429 


1296 



for the linear potential approximation and 

G s = exp(r) - 1 (228) 
for the frozen flow approximation. 



Equations ( p24| )- (|228| ) can be used in conjunction with equation ( pL42| ) and 
(|143|) to determine v n and S n by expanding G$ in the vicinity of r = 0. The 
values of Si, ..Sq obtained for the five approximations we consider are listed 
in Table 1. 

One can see that taking higher and higher order terms in the Lagrangian 
theory leads to successively more accurate approximations. As far as the S n 
are concerned, the PPZA is the most accurate of the approximations we have 
looked at, followed in decreasing order of accuracy by PZA, ZA, LP and FF 
||103|| . This conclusion is further borne out by a study of the moments T n = 
(O^c/id 2 )^ 1 of the scalar velocity variable 9 = H~ x Vv [|03| . 



Notice again that the values of S n depend also on smoothing: filtering the final 
density field using a top-hat filter leads to the following expressions for scale 
invariant spectra (P(k) oc k n ): for the "exact" perturbative case ||283| , p6| : S 3 = 



^-(n+3), 5*4 = 45.89-f (n + 3) + |(n + 3) 2 ; for the Zel'dovich approximation 
J37|, S 3 = 4-(n+3), S 4 = 30.22 -f (n + 3) + f (n + 3) 2 ; (also see §4.1, §4.10). 
For the quantity 6 we get, after top-hat smoothing: for the exact perturbative 
case, T 3 = — fi~ 0,6 [26/7— (n + 3)]; for the Zel'dovich aproximation, T3 = 
— fi~°' 6 [2 — (n + 3)]. We see that the Zel'dovich approximation consistently 
underestimates the skewness: for —1 < n < 0.7 T^ ZA ^ and even have 



differing signs [36|. 



A further study of spherical top-hat collapse in the different approximations 
shows that ZA and its extensions are much closer to the exact solution than 
either FF or LP right until the time of recollapse; see Fig. Evidently 
nonlinear approximations formulated in Lagrangian space (such as ZA, PZA 
etc.) are considerably more accurate in the weakly non-linear regime than 



those which are formulated in Eulerian space (LP, FF); see ||227| , [r^ , ^05|j38[1 
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Fig. 5. The evolution of a spherical density contrast in the different approximations 
is plotted against the exact analytical result. The exact solution is labelled by 1; 
the post-post-Zel'dovich approximation by 2; the post-Zel'dovich approximation by 
3; the Zel'dovich approximation by 4; the linear potential approximation by 5; the 
frozen flow approximation by 6; and the linear theory solution by 7. Reproduced, 
with permission, from []403f1 . 



An interesting difference between the Lagrangian and Eulerian approximations 
discussed above is that the former are essentially single step mappings whereas 
the latter involve following the trajectory of a particle or fluid-element through 
several timesteps. This makes the former more amenable to reconstruction 
methods such as POTENT [|T| . 



In the strongly non-linear regime (after shell crossing) the ZA breaks down 
and has to be abandoned in favour of its logical successor, the adhesion model. 



76 



In this regime 5 » 1 and one can no longer rely on a perturbative analy- 
sis to discriminate between the different approximation methods. The logical 
course to take then appears to be to test the different non-linear approxi- 
mations against the results of N-body simulations assuming identical initial 
conditions ||128|J493| , |390| , |96| , |392|| . (For a discussion of iV-body methods, see §7.) 



The problem is that one has to be careful in choosing the correct statistical 
comparison between the iV-body and test density fields. It is possible for an 
approximation scheme to produce a pattern of filaments and knots which has 
a very similar visual appearance to the full dynamical simulation, but has the 
main structures slighly displaced from their corresponding positions in the 
full simulation. On the other hand, one could argue that a true dynamical 
approximation should be able to move matter to the right place and that this 
statistic is therefore a reasonable test. Finding the right statistic to perform a 
comparison which is more specific to pattern than location is a difficult task 
which has not yet been fully solved. 

A recent comparison of non-linear approximations against the results of N- 
body simulations addresses this question in some detail by choosing different 
statistical measures (filamentary statistics, void probability function etc. see 
to discriminate between different approximation methods Some of 



the results of [[493|| are shown in Fig. @. The analysis of ||493|| seems to indicate 
that local Eulerian approximations to gravitational instability such as FF and 
LP begin to gradually breakdown as the non-linear length scale k~^\ becomes 
comparable to i?* = \fT>a\jai - characterising the mean separation between 
peaks (or troughs) of the gravitational potential. The reason for this is clear, 
particles in FF and LP follow the gradients of the initial gravitational potential 
asymptotically approaching the minima of the potential in FF, and oscillating 
about them in LP. Neither FF nor LP takes into account the possible presence 
of long range modes in the gravitational potential, a fact that is used to great 
advantage in both AM and TZ to move particles coherently over large scales. 
As a result the adhesion approximation fares considerably better than FF 
or LP, giving results that are reasonably accurate until the non-linear length 
scale becomes comparable to the coherence length of the potential, kJj L ~ R<j> 
(R,/, = VVa /ai). For models with a considerable spread of power such as 
CDM R<h ^> i?*, and we expect the adhesion model to perform better than 



FF or LP at late times 493 



4-8. Relativistic Approaches 



Throughout this discussion we have concentrated upon Newtonian approaches 
to the growth of inhomogeneities. This is generally justified by the fact that 
one is dealing always with peculiar motions which are much smaller than the 
velocity light and that, in the matter-dominated regime, the relativistic effects 
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Fig. 6. A comparison of four non-linear approximations: the adhesion model (AM), 
the frozen flow approximation (FF), the linear potential approximation (LP), and 
the truncated Zel'dovich approximation (TZ), is made against the results of N-body 
simulations. (The simulations have been performed using 512 2 particles in two di- 
mensions. The initial conditions are scale invariant, with P{k) oc k n and n = 0: 
this corresponds to n = —1 in three dimensions.) The results of the adhesion model 
are shown superimposed on the results of N-body simulations. The comparison is 
made for two distinct epochs: the panels on the left correspond to k7r L ~ i?* and 
the panels on the right to kJj L ~ R^. Reproduced, with permission, from p93| 1. 



of pressure are presumably negligible.^] Nevertheless, there are some reasons 

20 It should be pointed out that it is essential to treat petrurbations relativistically if one 
wishes to follow their evolution on scales larger than the Hubble radius Xh — ct. A gauge- 
invariant treatment of density perturbations was developed in p2| , subsequent discussions 



can be found in [169,402 337 525] 



78 



for considering a relativistic approach. Firstly, Newtonian theory implicitly 
possesses the physically disturbing property that gravitational influence is 
transmitted instantaneously ( "action at a distance" ) and thus violates causal- 
ity requirements. Secondly, the biggest problem in a Newtonian treatment is 
posed by the need to solve the Poisson equation; this can be avoided in a 
relativistic treatment because, if the constraint equations of General Relativ- 
ity are satisfied at early times, the evolution equations guarantee that they 
will be satisfied at all later times. Finally, the relativistic approach is concep- 
tually extremely different from the Newtonian approach and the consequent 
difference in formal construction may lead to approximation schemes that are 
different from those developed from the Newtonian approach. We shall confine 
ourselves to brief remarks on some of the latest developments in this area. 

This final point is especially pertinent at the present time because of recent 
studies of the dynamics of self-gravitating fluid flows. Following a much ear- 
lier suggestion of Ellis [|181|| , Matarrese et al. [|372|| suggested that the Einstein 



equations be approximated by neglecting the magnetic part of the Weyl ten- 
sor. The truncated equations thus obtained are strongly local in that they do 
not require any information about neighbouring spatial points to solve for the 
motion of any point. This approach, which has become known as the "silent 
universe" approach led to an exact solution of the Einstein equations and also 
allows various properties to be calculated numerically. It was subsequently 
shown that the exact solution obtained in ||372|| corresponds to the well-known 
Szekeres solution of Einstein's equations [ |139|| , which is in fact unstable. The 
properties of this instability led the authors of |47j to argue that the outcome 
of a generic collapse would be a cigar or spindle rather than the pancake pre- 
dicted by Newtonian theory. This analysis was criticised on the grounds of the 
validity of the approximation scheme used | |319| |. Further discussions and de- 



velopments of this approach can be found in [[3 73| , |3 74| , |90| , |1 8 3| , f48f . Although the 
final word has not been said, it appears that the approximation of neglecting 
the magnetic part of the Weyl tensor is not valid in general, but it may be a 
useful approach in situations admitting specific symmetries. 

An alternative relativistic approach to structure uses the powerful and elegant 
approach of Hamilton-Jacobi theory ||139| , f483| , f484|| . Using powerful Hamilton- 



Jacobi methods, they refined the analysis of Lifshitz and Khalatnikov 
who considered a Taylor series expansion of Einstein's equations. In fact, they 
consider an 'improved Taylor series' which ensures that the 3-metric of general 
relativity remains positive definite. In this way, they recover the Zel'dovich 
approximation and its higher order generalisations. A comparison between 
the relativistic and Newtonian theory is now being made at higher orders \ 2T \ 

Where these approaches will eventually lead is uncertain at the present time 
21 The authors wish to thank Dave Salopek for a discussion on these issues. 
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but there is certainly active interest in ideas emerging from relativistic con- 
siderations and there is sure to be a substantial investment of effort in this 
area in the near future. 



5. Non— linear Approximations based on Geometry 



Up to now, we have concentrated upon the evolution of denser-than-average 
structures as clustering develops. However, the dominant contribution to the 
perceived visual geometry of galaxy clustering is the nature of the under- 
dense regions, called voids. We shall discuss the statistical characterisation of 
voids in more detail in §8.6. In this section we discuss various ways of trying 
to understand the evolution of underdensities in the gravitational instability 
scenario. 



5. 1 . Void Expansion 



Ever since the discovery of the spectacular Bootes void in 1981, successively 
larger redshift surveys have confirmed that most of the volume of the Universe 
is contained in voids - underdense regions, virtually devoid of galaxies. Voids 
are separated from each other by a network of galaxies, sometimes said to 
constitute a cellular or honeycomb structure [|179| . |308| , |159| . |550|| . Although esti- 
mates of typical void diameters vary, sizes ranging from 10 to 50/i -1 Mpc have 
been quoted as being typical of voids in our neighbourhood p99| . |550| , |143| |. 

The conventional picture of voids has often been to associate them with under- 
dense regions in the primordial density field, which expand faster than their 
surroundings causing matter to be evacuated from within them and swept up 
into a shell at their periphery. The crucial role played by voids in the devel- 
opment of large scale structure was emphasized more than a decade ago by 
Zel'dovich and co-workers ||577| , |580|| . Most of the early work on voids was, 
however, carried out under the simplifying assumption that voids are spheri- 
cally symmetric, or else belong to a regular symmetric honeycomb structure 
ig^^g^gBJ. Icke's "Bubble Theorem" lends support to the 



idea that isolated voids can be treated as being spherical, by showing that 
aspherical underdense regions tend to become spherical in time - the time re- 
versal of the well-known Lin, Mestel & Shu instability ||343| . The bubble theo- 
107| , |1 88|j27 l| , f40| ,[54}1 has since been verified using numerical simulations^- 



rem 



22 It should be pointed out that voids in an ensemble compete with each other for space as 
they expand, and therefore need not remain spherical. Consequently, the bubble theorem, 
which is true for individual isolated voids, does not necessarily hold for a more realistic set 



of initial conditions [ 476 1 
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Although most treatments of void evolution have been within a Newtonian 
framework, attempts to model voids within a fully general-relativistic frame- 
work have also been undertaken |g5g^gg57| , gg5gg5gg5§gig ,gg,g0g| ; the 



study of voids has been extended to include models with a cosmological con- 
stant JJB7J. 



Theoretical modelling of individual voids has largely dealt with voids that 
are either compensated by an overdensity at the void boundary, leading to 
a vanishing overall density contrast, or without any such compensation. In 
the latter case, the overall density contrast is negative. Such uncompensated 
voids are often modelled by a spherical top-hat in which the density is con- 
stant within a spherical region. A common feature of both compensated and 
uncompensated void models is the gradual appearance of narrow overdense 
ridges surrounding voids. Such ridges arise because matter lying close to a 
void center tends to expand faster than that near the edges and thus over- 
takes denser matter lying ahead of it, resulting in shell crossing and the piling 
up of material at the periphery of voids. 

The initial density distribution in compensated and uncompensated voids is 
shown in Figure (0). The void has initial radius Ri and underdensity 6i (at a 
time ti), and is surrounded in the case of a compensated void by an overdense 
shell of radius aRi (a > 1). 

Examining the dynamical behaviour of both compensated and uncompensated 
voids, Bertschinger |4(J found several families of self-similar solutions in which 
the expansion of the void at late times could be described solely in terms of 
dimensionless quantities: a, Si, r/Ri and t/U. These solutions are similar in 
form to the Sedov similarity solutions describing a point explosion in a uniform 
medium ||503| , |502| , |270|| . In the case of a collisionless fluid, such as non-baryonic 



dark matter, the expansion of the void is accompanied by a transfer of energy 
from the interior of the void onto the peripheral shell. This leads to shell- 
crossing and collisionless phase-mixing which leads to the development of a 
self-similar solution which has no memory of the initial conditions. On the 
other hand, if the material is collisional (e.g. baryons), shell-crossing is pre- 
vented by the formation of shock waves. The subsequent dissipation of kinetic 
energy into heat leads to the eventual wiping out of memory of the initial con- 
ditions, and the development of a (different) self-similar solution describing 
the propagation of the shock front. In both collisional and collisionless cases 
the void boundary expands as A oc t 4//5 . In the collisional case the swept-up 
mass of the compensated shell increases as 

Bertschinger also found similarity solutions for uncompensated voids embed- 
ded in (a) collisionless, (b) collisional and (c) a mixture of collisional and 



collisionless media [fiO |. In all dense shell forms around the void whose 

swept-up mass grows as t 2 / 3 and whose expansion rate is A oc t 8 ^ 9 . This is 
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Fig. 7. The initial density profile of a void. The solid line corresponds to the radial 
density distribution of an uncompensated void, and the dashed line to the boundary 
shell of a compensated void. Adapted, with permission, from [40]. 



somewhat faster than the case of a compensated void; see also ||496|J55fl . 

It is often argued that, since the density of matter within a void is always 
bounded by \5\ < 1, one can therefore expect linear theory to be valid for a 
much longer duration within voids than is the case for overdense regions. This 
is not strictly the case, however. According to linear theory the density contrast 
within an underdense region scales as 5 oc — a(t) (in a flat Universe), whereas 
in reality the void underdensity must be constrained to have a magnitude less 
than unity. The result of this is that linear theory greatly overestimates the 
expansion rate of voids; see Figure We have also taken the opportunity 
to show, in Figure the behaviour of the underdensity in different non- 
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Fig. 8. The underdensity in the spherical top-hat model estimated from different ap- 
proximations, plotted against the exact solution. The exact solution is the solid line; 
the Zel'dovich approximation is ZA; the linear or /rozen-potential approximation is 
FP; the frozen-flow approximation is FF; linear theory is LIN. 

linear approximations which we discussed in the previous section. All these 
approximations tend to underestimate the density contrast. 

Studies of individual voids, although they clearly provide important clues 
about the dynamical behaviour of underdense regions, do suffer from the fact 
that they tend to ignore the presence of neighbouring voids. Such studies need 
therefore to be supplemented by other approaches which model voids as part 
of a larger ensemble of voids of different shapes and sizes. We shall discuss 
two such approaches here: the Voronoi model, which is based on geometric 
arguments, and the adhesion model, which we discussed in §4. The statistical 
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Fig. 9. Stereoscopic pair of three Voronoi cells sharing a common boundary. Stars 



indicate the Voronoi nuclei. Reproduced, with permission, from |542| 



characterisation of voids using the void probability function is discussed in 
§8.6. 



5.2. The Voronoi Foam 



In the Voronoi model, the large-scale structure of the Universe (voids, clusters 
and superclusters) , is described by a geometrical skeleton of walls, filaments 



and nodes which together constitutes a Voronoi foam ||552|| . 



The Voronoi foam is constructed as a tesselation, a mathematical partioning of 
space into three-dimensional polyhedral units (polygons in two dimensions). 
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Adjoing polyhedra share common walls, edges and vertices; see Figure (^|). The 
tesselation is contructed around and contains that part of the space which is 
nearer to its nucleus than any other nucleus. The resulting tesselation (which 
is unique) is called the Voronoi tesselation. 

The physical motivation for the Voronoi model is the observation that voids 
will tend to expand more quickly than surrounding material. As a result, mat- 
ter tends to flow out of voids and will generally become trapped in wall-like 
regions between adjacent voids. An additional assumption that is usually made 
in the case of the Voronoi model is that all voids expand at the same rate, so 
that walls separating neighbouring voids will in general form the perpendicular 
bisectors of the line joining the centers of void expansion [[269|| . Intersections 
of walls define regions of higher density (filaments) and intersections of fila- 
ments define zero-dimensional subunits (nodes). Matter in the Voronoi model 
is taken to flow out of voids onto walls, then along walls into filaments and 
finally into the nodes which are the vertices of the Voronoi polyhedra. 

The mathematical prescription used for constructing a Voronoi foam - the 
Voronoi tesselation - has been applied in fields as diverse as metallurgy, ge- 
ology, forestry, molecular physics and astronomy [[394| , [305| , |395| , |526|1 . The first 



practical implementation of the idea in studies of large-scale structure was 
made by van de Weygaert Sz Icke ||542|| , who used the Voronoi model to mimic 



cluster correlations by identifying each Voronoi node in their simulations with 
a cluster of galaxies; see also ||269| , [377|| . 



Their basic prescription to construct the Voronoi skeleton was as follows: a set 
of randomly (Poisson) distributed points were chosen to designate the Voronoi 
nuclei, the centers of void expansion. The number density of nuclei is the only 
free parameter in this model. Knowing the number-density of nuclei, one can 
construct the tesselation using either a kinemetical or geometrical approach. 
The geometrical approach is particularly simple to follow in two dimensions, 
since the basic structural units are then just filaments and nodes. A filament 
always bisects the line joing two neighbouring Voronoi nuclei. The end-points 
of a filament are nodes. Every node marks the termination of exactly three 
filaments. One can construct a Voronoi tesselation starting from any given 
Voronoi nucleus, and following one of the filaments that surround it in either 
a clockwise or an anti-clockwise direction until the filament terminates in a 
node. The next filament bordering the same cell is then chosen and followed 
until it also ends. This procedure is repeated until one ends up back at the 
original node. Repeating this procedure for all the Voronoi nuclei in a given 



sample, leads easily to the Voronoi tesselation ||568|J567 ] . 



The geometrical method described above is relatively easy to apply in two di- 
mensions but acquires considerable complexity in higher-dimensional spaces. 
The kinematic approach, on the other hand, can easily be generalised to an 
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arbitrarily large number of spatial dimensions. The gist of the kinematic ap- 
proach is that test particles are used to trace the Voronoi foam. The following 
procedure is adopted by van de Weygaert & Icke ||542|| : a test particle is chosen 
so that it is initially located close to a (randomly-chosen) nucleus, called the 
parent nucleus N\. The particle is then given a velocity which is proportional 
to its distance from the nucleus and directed radially away from it, so that 

u=£i-fi. (229) 

The particle is assumed to be located at X\ and the nucleus at f\. The motion 
of the particle is modified by means of an interative procedure: x\ — > x\ +vAt, 
until the particle becomes closer to another nucleus N 2 (located at f 2 ) than it 
is to its parent nucleus. At this point the test particle would have crossed a 
cell wall and would now be located just inside the cell defined by the nucleus 
N 2 . When the particle is equidistant from the two nuclei, it is constrained to 
move only within the cell wall, thus staying equidistant from both nuclei, until 
it becomes closer to a third nucleus N 3 than the previous two, indicating that 
it has entered a third cell by crossing a filament which (in three dimensions) 
forms at the intersections of three cells defined by the nuclei N%, N 2 , N 3 . 
The particle is now constrained to move along the filament by modifying its 
motion: 

v -> (vf)f, f = ni3 x n 12 ; 

n-r 3 . n- f 2 

n 13 = — — , n X2 = — ; ^t; (230) 

In - n| |n - r 2 \ 

f is the unit vector along the filament. (The vectors n 13 and n 12 are normal to 
walls defined by the nuclei N\N 2 and N\N 3 respectively.) The particle moves 
along the filament until it is closer to a fourth nucleus N4. The location of 
the particle at this juncture is close to a node, i.e. it is equidistant from four 
nuclei. After this, the particle does not move any further because it has traced 
out the locations of all the basic structural units in the vicinity of the parent 
nucleus N x ||542|| . Repeating for all nuclei leads to a Voronoi tesselation. An 



excellent recent discussion of the construction of three dimensional Voronoi 



tesselation's can be found in ||545 |. 



As mentioned earlier, the only free parameter in the Voronoi foam model is the 
mean number-density of Voronoi nuclei which can, in principle, be determined 
by normalising it to astronomical observations. Making the assumption that 
the nodes in the Voronoi simulation can be identified with Abell clusters of 
richness class 1Z > 1, van de Weygaert & Icke |542j| were able to determine 
the inter-nuclear separation to be ~ 104h~ l Mpcp 3 ! Interestingly, one can then 

23 The number-density of Abell clusters is assumed to be 6 x 10~ 6 h 3 Mpc~ 3 ; when compared 
with the ratio of nodes to nuclei in a Voronoi tesselation (6.733:1), this gives the number- 
density of nuclei. 
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Fig. 10. A two-dimensional Voronoi foam constructed with clustered nuclei. 



find the form of the two-point correlation function of the clusters (i.e. Voronoi 
vertices) to be 

m = (Pi~\ (23i) 



with r = 32/i _1 Mpc and 7 = 1.97 which is quite close to the observational 



results for the cluster-cluster correlation function (§8.2) [|T7|,|T6|. However, as 
we discuss in §8, more recent analyses of cluster correlations have indicated 
that the clustering scale could be much smaller than the original Bahcall- 
Soneira claim: r ~ 14/i -1 Mpc. This is a potentially serious problem for the 
Voronoi model with unclustered nuclei ||531| , |532|| . 

From a mathematical viewpoint the Voronoi tesselation around randomly- 
distributed nuclei is appealing because of its simplicity and because many 
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analytical results are known about the statistical properties of the tessela- 
tion in this case [|396|| . However, if the Voronoi model is to provide a useful 
approximation to physical processes that resulted in the formation of large- 
scale structure, the Voronoi nuclei must be associated with some physically- 
significant spatial locations. In models of structure formation based on ex- 
plosions, Voronoi nuclei can be associated with detonation sites, provided all 
explosions have equal strength and all the explosions occur at the same time. 
The arrival of two shock fronts in the same place will result in the formation 
of a Voronoi wall. Three shocks will lead to filaments and four to nodes. Fur- 
thermore, since there is no a priori prescription exists for the explosion sites 
it is quite reasonable in this case to take them as being randomly distributed 



On the other hand, in more conventional models structure formation proceeds 
by gravitational instability, a process which generically leads to the formation 
of a cellular structure ||388|| ; see §4. In order to apply the Voronoi idea to 



this case, one might associate Voronoi nuclei with centers of the primordial 
underdensities or peaks of the primordial gravitational potential which are 
the progenitors of voids ||322|J476| . However, in most scenarios of structure 



formation the gravitational potential is spatially distributed in the manner of a 
Gaussian random field. Since peaks of a Gaussian random field are clustered, it 
seems a proper application of the Voronoi technology in this case must involve 
a clustered distribution of nuclei. This inevitably introduces more freedom 
into the model through the introduction of parameters needed to describe the 



clustering; see Figure (fLOP- 



Although we have assumed for simplicity that expansion rates in different 
underdensities are identical, this is also not going to be strictly correct. A 
study of voids performed using the adhesion model demonstrates that void 
expansion rates generally depend on the size of the gravitational potential 



fluctuation inside them ||476|| . One might therefore seek to modify the Voronoi 
idea by assigning different weights to Voronoi nuclei, in order to incorporate 
this variation of expansion speeds. The simplest tesselation which includes 
such an effect is called the Johnson-Mehl tesselation | ]278j ]. As we shall see, 
this closely resembles the geometrical structure obtained by evolving with 
the adhesion approximation. A Johnson-Mehl tesselation will also describe 
an explosion model in which the explosions have varying degrees of intensity. 
In the Johnson-Mehl tesselation, walls joining adjacent nuclei are not planes 
and do not bisect the line joining the nuclei. Although it is more difficult 
to construct using simple geometrical arguments, it can be easily generated 
using a variation on the kinematical approach discussed above. It can also 
be constructed using the adhesion approximation, as described in the next 
section. 
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5.3. The Adhesion Approximation Revisited 



We saw in §4.5 how the adhesion approximation provides a simple geometrical 
way to determine the masses and momenta of "knots". It also furnishes an 
elegant prescription for finding the volumes of void regions. This prescription 
involves dividing Lagrange space into stuck and free regions by means of the 
following procedure. The status of a particle originally located at % can be 
determined by descending a paraboloid P(x, a;q) = (x — q) 2 /2a{t) + h onto 
the linear gravitational potential <fto, in such a manner as to be tangential to 
the potential at go; see Figure fllT} ). 

If the paraboloid touches or intersects the potential at a point other than % 
then the particle in questions has already entered a caustic, otherwise it has 
not. Using this procedure, we can divide Lagrangian space at any time into 
stuck and free regions. Stuck regions correspond to particles already in caustics, 
whereas free regions have not yet shell-crossed and are therefore still expanding 
via the Zel'dovich relations, equations ( |160[ ) & ( |164[ ), and correspond to voids 



The location of caustics in Eulerian space can be found very simply by mov- 
ing the border between stuck and free Lagrangian regions by means of the 



Zel'dovich approximation |[476|| . In Figure (|12|) and Figure (|H|) the location of 
caustics (filaments and knots) is shown superimposed upon pictures of stuck 
and free regions. At early times, most particles have not yet fallen into pan- 
cakes, so that the resulting free regions percolate in Lagrangian space, whereas 
stuck regions do not; Figure (|r2|) . At late times the situation is reversed since 
most of the matter is now in caustics, so that stuck regions percolate whereas 
free regions exist as isolated regions; Figure (pT3[). The percolation of stuck and 
free Lagrangian regions is closely related to the issue of the global topology of 
large-scale structure, i.e. whether it is bubble-like or sponge-like; a discussion 



of this issue can be found in §8.8 and in ref. ||478 



This procedure so far has given us the volume of voids in Lagrangian space. 
In order to obtain their volume in Eulerian space (or real space) we apply the 
volume deformation formula obtained from the Zel'dovich approximation: 

dV c = dV x [l - a(t)Ax(q)][l - a(t)A 2 (q)][l - a(t)A 3 (q)], (232) 

where dV e and dVi are the volume elements in Eulerian and Lagrangian 
space respectively. Summing over all elementary volume elements within a 
given Lagrangian space void now gives us the volume of the void in Eulerian 
space: 

N 

V e = £ dV e = E dV i(i)[l " a(t)Ax(q)][l - a(t)A 2 (q)][l - a(t)A 3 (q)], (233) 

i=l 
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Fig. 11. The geometrical prescription of the adhesion approximation. A paraboloid 
is descended onto the gravitational potential in order to demarcate stuck and free 
Lagrangian regions. The peaks of the potential correspond to voids in the Zel'dovich 
approximation. The particle having Lagrangian co-ordinate qo is free in the upper- 
most figure, and has just entered a caustic in the middle figure. The apex of the 
paraboloid, x, describes the location of the caustic (clump with mass m) in Eule- 
rian space. Middle and lower panels illustrate the merger of clumps. Adapted, with 
permission, from [476]. 



where N denotes the number of elementary free volume elements in a given 
Lagrangian-space void. This quasi-analytic method is very simple and can be 
used to obtain the spatial distribution and sizes of individual voids, as well as 



the integrated void spectrum [476 



We would like to mention at this point that voids defined according to the 
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Fig. 12. The distribution of caustics (dots) is plotted superimposed upon stuck (un- 
shaded) and free (shaded) Lagrangian regions for early times when cellular structure 
is just beginning to form. The distribution of caustics in this case was obtained by 
moving the border between stuck and free regions using the Zel'dovich approxima- 



tion. Reproduced, with permission, from [476]. 



adhesion model constitute a subclass^] of voids denned in the Zel'dovich ap- 
proximation. This follows from the fact that in the Zel'dovich approximation, 
a necessary and sufficient condition for the existence of underdense regions is 
that all eigenvalues of the deformation tensor are negative. This is not the case 



Similarly, voids in the Zel'dovich approximation constitute a subclass of voids defined 
according to linear theory. This can be seen from the fact that linear theory voids satisfy 
the inequality Xi < (which follows trivially from the Poisson equation (|2(])). Zel'dovich 
voids, on the other hand, satisfy a more restrictive class of initial conditions Ai, A2, A3 < 0, 
which follows from the requirement cIVe > dVh in equation (p32j) . 
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Fig. 13. The distribution of caustics (dots) is plotted superimposed upon stuck (un- 
shaded) and free (shaded) Lagrangian regions for late times when most of matter is 
already in caustics and cellular structure is fully formed. Reproduced, with permis- 



sion, from |476], where details of the simulations are provided. 



in the adhesion approximation. The reason for this is that the adhesion ap- 
proximation is inherently non-local. One can see in Figure (pT|) that a region 
which is declared a void at an earlier epoch is no longer one at a later epoch 
when it has merged with two neighbouring large voids. Thus not all peaks act 
as void progenitors at all times. This example also underlines an important 
feature of non-linear gravitational instability. This is that the evolution of 
small voids is prone to the tidal influence of surrounding larger voids, which 
can cause the void to shrink and even disappear. The disappearance of small 
voids in the course of cosmological evolution is accompanied by a flow of power 
from small scales to larger scales, as illustrated in Figure ([H|) [ |301| , p21|Jl^J476 
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Fig. 14. The void spectrum (number fraction of voids as a function of diameter) is 
shown for the truncated power-law spectrum P(k) oc k for k < k c ; P(k) = for 
k > k c where k c = 16 x kt and kf is the fundamental mode of the box. The figures 
from top to bottom correspond to three different expansion epochs described by the 
scale factor a. (The scale factor a is also just the linear density contrast on the grid 



scale.) Reproduced, with permission, from [476]. 



For scale invariant spectra P(k) oc k n , — 1 < n < 1, this results in the following 
late time growth law for the mean void diameter: D voi( i oc n+3 ^(i), where 
D + (t) is the growing mode of the linear density contrast [238, 476|| ; recall that 



D + oc a(t) if the Universe is spatially flat and matter-dominated. 

An interesting feature to emerge from studies of voids performed using the 
adhesion approximation is that the sizes of voids tend to be strongly corre- 
lated with the height of the primordial gravitational potential at void centers 
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Fig. 15. The value of the normalised primordial gravitational potential 4>/4> rm s eval- 
uated at the centers of voids (in Lagrangian space) is shown plotted against the void 
diameter for the COBE normalised CDM spectrum. Reproduced, with permission, 



from [476]. 



(defined in Lagrangian space), with larger voids originating from those re- 
gions where the initial potential is higher; see Figure (|To]). However, voids will 
not in general have a one-to-one association with high peaks in the poten- 
tial simply because complicated features in the primordial potential - ridges, 
saddle-points, etc. - can sometimes lead to the formation of voids in those 



regions where the potential does not have a well defined maximum ||476 



At late times, voids in Lagrangian space occupy a negligible fraction of the 
total volume, and it is straightforward to implement the adhesion approxi- 
mation by considering only the maximum value of the gravitational potential 
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Fig. 16. The adhesion approximation at late times - the "needle approximation" . 



in separate localised regions (Lagrangian-space voids) as the progenitors of 
structure. We call this the "needle approximation" since the gravitational po- 
tential is effectively replaced by a set of discrete vertical needles on which 
the paraboloid is moved in order to map out the skeleton of the large-scale 



structure; see Figure (|I6|) 



The location of the needle is similar in many respects to the location of the 
Voronoi nuclei, and indeed the needle approximation gives a spatial geometry 
which is in many respects equivalent to a Johnson-Mehl tesselation. 

An important characteristic of the adhesion model, which is not shared by 
most approximation schemes, is that it constructs a skeleton that evolves 
dynamically with time. This can be expressed in geometrical language by 
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saying that the paraboloid mapping the large-scale distribution surveys larger 
and larger regions of the gravitational potential as the Universe evolves. It can 
therefore describe such dynamical processes as the merger of knots and the 
disappearance of small voids, which is not possible in an approach which is 
manifestly local in construction such as the Zel'dovich approximation. It is 
also clear that if all peaks of the gravitational potential were taken to have 
equal heights, the skeleton of large-scale structure would freeze at late times 
in the adhesion framework. The adhesion approximation would in this case be 
entirely equivalent to a Voronoi tesselation, performed using the local maxima 
of the gravitational potential as sites for the Voronoi nuclei. 

Studies of voids have also shown that they can be populated by substruc- 
tures, such as mini-Zel'dovich pancakes and filaments, which run through 
voids bounded by larger pancakes ||476| , |544|| . The likelihood of a void being 



populated by substructure increases with its size at a given time, and decreases 
with time because voids get emptier as they evolve. Interestingly, evidence for 
substructure within existing large voids has been reported for both the Bootes 
void ||561|| and for voids in the Cf A survey ||425|| . 



6. Statistical Solutions and Scaling Arguments 



In this section we discuss methods for obtaining statistical information about 
non-linear clustering in an analytical approach. In these approaches, one aban- 
dons the idea of covering the evolution of individual fluid elements and concen- 
trates instead upon global statistical properties like the correlation functions. 
Even with this simplification, analytical progress is difficult and one normally 
has to resort to further approximations. This normally involves the assump- 
tion of some form of scale-invariance (for which there is some evidence from 
the observed properties of the galaxy distribution; Sec 8). 



6.1. Scaling model for the Correlation Function 



We begin with a simple model for the evolution of the two-point covariance 
function of the matter fluctuations which was first suggested and studied in 
|247|| for the case where Q — 1. The simplest way of understanding this model is 
by reference to the spherical collapse model we discussed in §3.1. Recall that, in 
the case where Q — 1, a spherical inhomogeneity virialises at an actual density 
contrast of order around 100 at a time when the density contrast predicted by 
linear theory would be of order unity. Now the two-point covariance function of 
the matter fluctuations is, roughly speaking, the 'average' profile of a spherical 
clump. Assuming that collapse occurs when the integrated overdensity reaches 
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some critical value, it seems that the relevant quantity to deal with should be 
the volume-averaged correlation function, defined by 



3 ? 



(234) 



A density enhancement of a factor 1+5 may be thought of as arising through 
a spherical collapse by a factor (1 +5) 1 / 3 in radius so one could further expect 
a given integrated non-linear correlation on a scale rjvi to have arisen from 
linear fluctuations on a scale in the initial conditions: 



1 + £,nl{?nl) 



1/3 



TNL- 



(235) 



We must now conjecture that the non-linear integrated correlation function 
is a universal function of the linear one, after taking account the change of 
scale: 

tNL(r NL ) = f[( L (r L )]. (236) 

The success of this approach depends upon finding the appropriate function 
/ which should be independent of the form of the initial power spectrum (or, 
equivalently, correlation function). We can see quite easily that, for small x, 
the function f(x) should be f(x) ~ x to reproduce linear theory. The spherical 
collapse model further suggests that f(x — 1) ~ 100. If we assume that at 
large x the fluctuations enter a period of stable clustering |[44C|| , we should 
get £tvl oc a 3 . But the linear covariance function scales as a 2 . Asymptotically, 
therefore, we have f(x) oc x 3 ! 2 for large x. In between these two extremes, one 
can calibrate the observed behaviour of £ against numerical simulations and 
find a fitting formula for the intermediate range of scale |[247|| . Remarkably, it 



does appear that a scaling formula of the form ( |236| ) does indeed exist that 
satisfies the two asymptotic limits: 



/(*) 



x + 0.358x 3 + 0.0236x 6 
1 + 0.0134s 3 + 0.00202x 9 / 2 ' 



(237) 



A generalisation of this formula to non-flat universes and models with a cos- 
mo logical constant has been given in | 431 |, who also showed how to apply 
similar techniques to the evolution of the power spectrum rather than the cor- 
relation function; it has also been discussed from a theoretical point of view 
in ETUI. 



6.2. Statistical Mechanics: the BBGKY hierarchy 



To obtain a more detailed statistical description of a non-linear self-gravitating 
system requires a much more sophisticated approach than that above, and this 
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leads us into a treatment based on statistical mechanics. The pioneering inves- 



tigation by Davis and Peebles ||1 50|| into the hierarchical closure of the BBGKY 
equations is of such historical importance in the understanding of non-linear 
gravitational clustering that we will discuss it here in some detail. We begin 
with some basic statistical mechanics. 

Consider a (classical) system of N particles, interacting with each other via 
two-body forces described by a potential <f>(xi, Xj), where Xi is the position of 
the i-th particle (i = 1, . . .N). We take the particles to be identical, so the 
Hamiltonian for the system is symmetric under interchange of particle posi- 
tions; the potential is taken to be a function only of the particle separations: 

<pij = (f)(Xi, Xj) = (j){\Xi - Xj\). 

The phase-space for this system is 6iV-dimensional, denoted by F = 71 © 
72 • • • © In where 7; is a one-particle phase space. The system is described by 
a point in the phase-space T; let the probability density function (pdf ) of this 
point be P N . The evolution of P/v is governed by Liouville's equation: 

9Pn . dP N ^ _ dP N 

^ + ?^^r + ^-^r = - (238) 

The particle accelerations a$ are given by 

The fact that this equation involves a 6iV-dimensional function Pn encour- 
ages one to seek a lower-dimensional representation. The way to do this is to 
marginalise the distribution Pjy over part of the full phase space. For example, 
to construct a one-point distribution Pi (71), one simply marginalises over all 
but one of the phase space elements: 

P 1 ( 7l ) = J P N d l2 d lz . . . d lN . (240) 

Marginalising both sides of eq. (|238|) leads to 



(The factor of (N — 1) comes from the sum over i 7^ j terms; [ |486|| ). Equation 
(|241|) makes the problem clear. We cannot solve for Pi without knowledge of 
P 2 (7i,72) and we will not be able to get P 2 without knowledge of P 3 and so 
on. We will therefore be led to a hierarchy of iV equations and, apparently, 
no better off than with the original Liouville equation ( |238| ). The physical 
reason for this is that the evolution of a phase-space point depends on the 
distribution of all the other points, which in turn depends upon the position of 
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the original point. The fact that Pi depends only on P2 explicitly is by virtue 
of the two-body nature of the force between the particles. 



Having established the need for an hierarchy, we now derive it in the stan- 
dard form called the BBGKY equations, after its inventors Born, Bogoliubov, 
Green, Kirkwood and Yvon |T9|J486|| . It is convenient to change the definition 
of our functions P n by including a factor in the marginalising rule: 



^n(7l, 



-7nJ 



(N-n)\ 



Pn(ii, 72 • • • 7Af)^7n+i • • • dj N . 



(242) 



In this equation, p is the mean real-space number-density of objects (par- 
ticles). The factor in front of the integral in eq. ( |242|) takes account of the 
number of ways of interchanging labels among the objects within Pn; includ- 
ing this factor in the marginalising rule prevents the number of particles N 
from appearing explicitly in the equations. P n {l\ ■ ■ ■ 7n) is now the probability 
of finding any n objects simultaneously in the phase space elements 71 . . . 7„. 
Marginalising both sides of eq. ( [24 1|) according to this rule and integrating 
gives, fairly straightforwardly, 



d n 
Ul i=i 



_d_ 



E 



'■j 







n r 



ki 9 P 



n+l 



+1- 



(243) 



Details of the steps involved in this derivation are given in [486]; see also [19]. 
The equations fl243j ) - there are iV of them - are called the BBGKY equations 
and can be applied to any system evolving according to equation ( |238| ). The 
equations ( |243|) can also be derived from the Vlasov equation [f|6 



To apply the preceding formalism to the case of cosmological perturbations 



is not immediately straightforward because the system is expanding [440 



The effective LagrangiarQ for a particle moving in an expanding universe, in 
comoving coordinates, is: 



T L 2-2 

L = -ma x 



m0[x) 



(244) 



The canonical momentum is p = ma 2 x; if (f) = then p is conserved so that 
v = a(t)x oc l/a(t). The potential obeys Poisson's equation (^0|). We can now 
write down the BBGKY equations in an appropriate form: 



dP n 



+ Vi 



dP n Gm 2 
+ 



d"i n+1 — 



dP n+ i 



0. 



(245) 



at axi a J \Xi — x n +i\ &Vi 

where we implicitly sum over the index i, from 1 to n. 

25 This Lagrangian, in conjunction with the principle of least action, can be used to trace 
back galaxy motions and recover the initial configuration of, for example, the Local Group; 



see [446,35 44? 
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Now we have to worry about trying to close this hierarchy The approach 



of Davis and Peebles ||150|| , which we shall initially follow, was to solve for 
the first two moments of the distribution function after invoking a separation 
between velocity and position space and a hierarchical form for the three-point 
correlation function of the particle positions, as follows. First, we construct 
the first three moments of the velocity distribution after assuming that our 
system is statistically homogeneous, so that the one-point velocity distribution 
function, Pi can be taken to be independent of position: 

J d 3 vP x (v) = na 3 (246) 

Jd 3 vP 1 (v)v = (247) 

f t^tTPx^wV = -na 3 (vl)5 ij , (248) 
J 3 

where n is the mean number-density of particles and (vf) is the one-particle 
velocity dispersion; v i denotes the i-th component of the velocity. 

We now need integrals over the two- and three-point distributions, which we 
denote by P 2 (l, 2) and P 3 (l, 2, 3): 

J d 3 Vid 3 v 2 P 2 {l, 2) = {na 3 ) 2 {l + £ 12 ) (249) 

J d 3 v x d 3 v 2 d 3 v^{l, 2, 3) = (na 3 ) 3 (l + £ 12 + 63 + 63 + C123), (250) 

where £12 = £(a?i — x 2 ), etc. Finally, we need the first two pairwise relative 
velocity moments: 

>^ 3 ?7 2 P 2 (1,2K 2 = {na 3 )(v\ 2 ){l + £12) (251) 
d 3 v x d 3 v 2 v\ 2 v{ 2 = (na 3 ) 2 (v\ 2 v{ 2 )(l + £ 12 ), (252) 



where u 12 = v\ — v 2 . 

The above definitions are substituted into eq. ( [24 5| ) for n = 2 and we are left 
with three equations: 

l i + ^ [fe)(1+&2)H0; (253) 



-A\«2)(l + 62)] + -_^rl(v>L)(l + 62)] + 



a dt 
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(254) 



23, 
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l__d 

Gpa 



a 2 (v\ 2 v{ 2 )(l 
d 3 x 3 
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12, 
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+ - 



d 



a dx k 



12 



^12^2(1 + 6 



12, 



i(i +ei3K 3 -^(i + 63)^3 + ^2) 



+ 



X 



x 23 



= 0. (255) 



23, 



These are dynamical equations governing the evolution of the first few corre- 
lation functions and velocity moments. Notice that the average (P^v\ 2 ) is over 
the three-body phase space distribution: it is a three-body weighted pairwise 
velocity. 

The hierarchy, or at least the first few levels of it, are now in a more familiar 
language to cosmologists but we still have to introduce some argument in order 
to close it. We are helped by the assumption of large-scale homogeneity and 
isotropy; the resulting symmetries reduce the number of independent tensor 
components. We thus expect the mean pairwise streaming motion (v\ 2 ) = 
v(xi 2 )x\ 2 , i.e. aligned with the separation; the velocity dispersion can be split 
into longitudinal and transverse polarisations: 

(v\ 2 v{ 2 ) = £*^n(x 12 ) + {5 ij - x i x j )T l {x l2 ). (256) 

The notation here follows [|196|J467 l; beware of different definitions of S and 
II [150|! We now need to model the three-body weighted pair velocity, the 
three-point correlation function, £123 and the third velocity moment (v l v^v ). 

In the limit of weak correlations, one can merely ignore the higher-order 
correlations [ 440|| , or treat them term by term in a perturbative expansion 



198, 213, 33, 34 1 . This approach has led to many interesting results in the quasi- 



linear regime. 

When the correlations are strong, however, three-point correlations will be 
even larger in amplitude than two-point correlations, so we cannot use such 
a scheme. The Davis-Peebles approach [ |150]| is to close the hierarchy using 
a phenomenological model. From the observed behaviour of the three-point 
correlation function, it is reasonable to model £123 by the hierarchical form: 



C123 — Q (£12613 + 612623 + 6136: 



23 J 



(257) 



(see section 8.3). Indeed such a scaling may also be relevant for the correlation 
functions of even higher order, as we discuss later. 

To close the velocity moments is rather harder. Here we part company with 
150|| and follow the steps taken by P67|| . The former authors assumed, without 



much justification, that the skewness of the velocity distribution should van- 
ish. Less restrictive considerations lead to the following form for the required 
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average: 

= A(3vU + v^PxPS? + 5vE(xM J ' fe + + x fc cf j - 3^^'^). (258) 

(The Davis-Peebles form of this expression has A — B — 1). The more general 
form for the three-body weighted pairwise velocity is 

(P3^ 2 ) = ^ 2 (63+e23)+«23-^ei3+Qi^ 2 ei2(ei3+e 2 3)+(^L-43)ei3e23]- 

(259) 

(The analogous Davis-Peebles result has Q* = Q). Given this result one can 
actually determine A and B in terms of Q and Q*. Remarkably, it seems 
that Davis and Peebles got lucky in that the choice Q = Q* requires that 
A = B = 1. They obtained a consistent approximation, when they could 
quite easily not have done so. Substituting these forms into the dynamical 
equations ( |253|) , (§54]) & ( |255|) leads to four equations for the four unknowns 



£, v , II and E which we will not reproduce here; see [467] . 



In the strong clustering limit (£ 3> 1) one can simplify the equations consid- 
erably by taking a power-law form for the two-point correlation function as 
a solution of the first dynamical equation (253), which becomes in this limit: 



4a ~ (*) = ° (260) 

The solution is of the form £ ~ a^x -7 with (3 = 3 — 7 but 7 < 2 for the 
equations to converge. From this solution one finds E ~ II ~ a 2_7 x 2-7 . These 
functional forms are independent of A, B,Q,Q* . It turns out that B = 1 for 
consistency, but A is determined by 

so that A — B if and only if Q = Q*. Notice however that the power-law 
solution here exists for any value of 7. To obtain the final result, one assumes 
that self-similar clustering, which is the ansatz we have used to close the 
hierarchy, continues all the way from the linear regime. In the self-similar 
solution, we write £(x,t) = £(st), where s = x/t a for some power a. For an 
initial power-law power spectrum of index n, linear perturbation theory gives 
£ ~ t 4 / 3 a:~( 3+n - ) . If we assume that this solution matches onto one of the infinite 
number of scaling solutions in the strongly non-linear regime, we are left with 
an equation relating 7 and n: 



Obviously this treatment rests on a number of simplifying assumptions whose 
validity can be questioned [ |196| . p^ . ^42| . po 77 ^ . [ID] | . These results are neverthe- 
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less generally taken as confirming the assumption that gravitational cluster- 
ing evolves hierarchically in the non-linear regime. In our view, the issue 
of whether this is indeed a correct interpretation of the BBGKY analysis is 



far from settled. For a start, the analysis of ||485|| , suggests that £ should 
tend asympotically to a slope of 7 = 2 in the non-linear limit, independent 
of n. Numerical experiments using self-similar initial conditions apparently 
produce self-similar behaviour, but do not have power-law correlation func- 
tions ||176| . Simulations with non-self-similar initial conditions do not evolve 



self-similarly and have correlation functions whose slope varies with time; 
indeed the slope 7 seems quite happy to pass through 7 = 2 without caring 
|148|| . Hamilton |242| has further objected that cluster-cluster correlations and 



galaxy-galaxy correlations should be equal if the distribution were truly self- 
similar and so these considerations do not apply to the real Universe. These 
points seem to confirm the conclusion of | 467|| that the hierarchical solution 



of the BBGKY equations may be unstable to non-self-similar perturbations. 
Given that cosmologically-interesting primordial spectra are not usually of 
scale-free form one should be very cautious about using this type of analysis 
in anything other than qualitative arguments. 

As a final remark in this section, we point out that all the problems associated 
with the closure ansatz can be avoided if one deals directly with the Liouville 
equation for P^, rather than marginalising to the P n . One suggestion as to how 
to avoid the hierarchy thus induced is by dealing with the de Finetti generating 
function of the distributions, rather than the distributions themselves. The 
equation for the time evolution of this function turns out to be remarkably 
simple and may eventually lead to progress in this field, although it is a novel 



approach which has only just begun to be investigated [233 . 



6. 3. Thermodynamics 



Since the difficulties involved in climbing the BBGKY ladder seem so extreme, 
even for only the third rung, one is tempted to abandon this approach and 
try for an even more general description of the properties of self-gravitating 
material. 



In a number of papers, Saslaw and co-workers j4^ , p6jp^ , p8|M , [T3l . |49U| . |492| . pr3 



have developed a thermodynamic approach to galaxy clustering in which they 
concentrate upon the distribution function f{N), the probability^ of finding 
iV particles in a (randomly-positioned) spatial volume V. The assumptions 
involved in this theory are perhaps even more simplified than in the BBGKY 
closure problem but, to coin a phrase, the proof of the pudding is in the 
eating and the predictions of the theory do seem to be in good accord with 

26 We use f(N) and Pn(V) interchangeably to define the same physical quantity; see §8. 
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observations of galaxy counts. 



Basically, the theory assumes that a kind of gravitational quasi-equilibrium 
operates in galaxy clustering. Now, obviously, there is no thermodynamical 
equilibrium overall in a self-gravitating system; evolution to at least one sin- 
gularity seems to be the generic outcome. What one requires is that evolution 
of a clustering hierarchy should take place "slowly" enough that each succes- 
sive phase of the evolution is approximately described by an equilibrium law. 
Further objections can be raised, and some counter-arguments are discussed 
extensively in [491 1. A more careful derivation of the distribution, with slightly 



relaxed assumptions is given in ||490 | 



The main result of the theory is that the distribution function f(N) has the 
(unique) form: 



/(AO 



N{1 - 6) 

W\ 



N{1 - 6) + Nb] N 1 exp (-N(l - b) - Nb)\ . (263) 



This distribution has many interesting statistical properties |[488|| , including 
the fact that it is invariant under projection from 3D to 2D like the Poisson 
distribution. In eq. (|263|) , N is the mean number of objects in V; N — nV 
and b is a parameter which is the ratio of potential energy to kinetic energy 
in the system: b = —W/2K. For a Poisson distribution 6 = and 



/(AO 



N 



N 

-exp (-AT). 



(264) 



In linear theory b = 3/ 2 /4f2, which reduces to b = 3/4 for Qq — 1. In the fully 
relaxed state described by virial equilibrium, 6=1. Perhaps one of the reasons 
for the success of this theory is that the parameter 6 only changes slightly 
during the course of evolution and, although the appropriate 6 may well be 
different in more-evolved (i.e. denser) regions, the distribution averaged over 
the whole system still resembles eq. (|263|) . Some more detailed properties of 
the distribution ( |263| ) are discussed in §8.5 where we also discuss the properties 
of its moments. 



To make more detailed statements about 6 requires one to go back to both 



the BBGKY hierarchy (sec. 6.1) and use the cosmic energy equation [440]. On 
this basis, one can show that in a cosmological model where the scale factor 
grows as a(t) oc t a , the value of 6 should be given by 



2a + 1 
2a + 2' 



(265) 



which is in good accord with the observed value 6 = 0.70 ± 0.05, taken from 



the best fitting form of (|263| ) to the counts of galaxies in the Zwicky catalogue 
||137[ . More recently, Saslaw has shown that the time-evolution of 6 is described 
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by the relation: 



a(t) 



J) 1 ' 



b) 



-7/8 



(266) 



Some authors, e.g. [48S], have claimed that this is the behaviour followed 



by iV-body numerical simulations of self-gravitating systems, but this has 



been questioned [79|. There is evidence from the latest galaxy redshift surveys 
that f(N) is indeed well fitted by the form (|263 ); there are, however, other 
distribution functions, such as the Poisson-lognormal model ||127||, which fit 



the observations just as well 1515]. 



6.4- Fractals and Scaling 



The self-similar properties that seem to be implied by both observations and 
the BBGKY analysis described above, lead one naturally to a description of the 
mass distribution in the language of fractal sets [|360| , |361| , [r4| . The prevalence 
of techniques based on fractal geometry in fields such as condensed matter 
physics has led to an interest in applying these methods to the cosmological 
situation. 



To get a rough idea of the fractal description consider the mass contained in a 
sphere of radius r around a given galaxy. In the case where £(r) ^> 1 we have 



M(r) cx £(r)r 2 



(267) 



since £(r) has a power-law form with a slope of around 7 = 1.8 then we have 
that M(r) cx r L2 . In the language of fractals, this corresponds to a correlation 
dimension of D = 3 — 7 = 1.2. One can interpret this very simply by noting 
that if mass is distributed long one-dimensional structures (filaments) then 
M(r) oc r; two-dimensional sheets would have M cx r 2 and a space-filling 
homogeneous distribution would have M cx r 3 . A fractional dimension like 
that observed indicates a fractal structure like those discussed by Mandelbrot 



360,361 



Many attempts have been made to apply this kind of description to the ob- 
served universe [|438| . |2 19| . |r75| . |440||452| , |l 1 5| , |1 16]| . A completely fractal universe 
actually would cause a lot of problems for us, because it need not tend to ho- 
mogeneity on large scales and we would not therefore be able to do statistical 
cosmology! Fortunately, it seems that the real Universe is not described by a 
simple fractal of this kind [369]; a more sophisticated description is needed, 
in terms of multifractals ||423||74| . Nevertheless, simple scaling arguments can 
be useful for describing the behaviour of simple statistical properties of the 
matter distribution; see, for example, ||175| , [247|}431| . 



One can understand a multifractal model in terms of the simple argument 
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given below. Suppose that, around any individual point, the mass contained 
within r scales as M{r) oc r a , but that a varies from point to point: a = 
a(x). In a simple fractal - a monofractal - a would be independent of spatial 
position. If we make the assumption that the subset of points upon which 
a takes a given value is itself such a monofractal, then the set as a whole 
consists of a set of interwoven simple fractals of different dimensionality and 
is therefore termed a multifractal. This is therefore a generalisation of the 
concept of a fractal, but it is not completely general. The function f(a) defines 
the dimensionality of the sets of constant a: it must have only one maximum 
at a ; f{ a o) — Dq is the capacity dimension of the whole set (sometimes called 
the Hausdorff dimension or, more loosely, the fractal dimension) . The crucial 
property that a multifractal set possesses is that the scaling dimensions for the 
q— th moment of cell counts, the generalised dimensions, are not independent 
of q: 

D q = lim 1 lK >' J 268 

r— o(g-l) logr 

(g ^ 1; n>i(r) is the occupation number of the z-th cell of side r; N is the 
number of cells. An alternative definition is required for 5 = 1, but we shall 
not discuss this special case here. For a monofractal, D q = Do for all q. We 
shall discuss the methods for extracting D q from the data in Sec. 8.9; it is 
clear that D q ^ Dq for both data and simulations ||369| , |370| , |368jl and that the 



multifractal description provides a good qualitative model for the form of the 
clustering. 

It is perhaps helpful to consider a simplified two-dimensional model which 
demonstrates the difference between monofractal and multifractal distribu- 
tions. First, consider a square, divided into four equal square pieces. To each 
of the four sub-squares is assigned a number p t (i = 1, 2, 3, 4), where ^2pi = 1. 
Each sub-square is then divided in the same way as the original, and each 
subdivision is randomly assigned one of the Pi as above. This process is re- 
peated an arbitrarily large number of times. If it is repeated n times we have 
generated a 2 n x 2 n array of numbers, each one of which is a product of n of the 
Pi, randomly chosen. It is quite easy to show that the D q for this distribution 
can be written 

D q = (l- q)- 1 log 2 (p? +p q 2 +pl+ p\) , (269) 



for n — ► oo. The proof of this is given by Meakin |380fl . Now suppose p\ = 



and pi = p 2 = P3 = 1/3. In this case D q = D for all q and the model 
is a monofractal: each occupied cell has been produced by the same kind of 
scaling as all its neighbours. If the Pi are not degenerate, e.g p\ = 0.4, p 2 = 0.3, 
P3 = 0.2 and p^ = 0.1 then the D q are a function of q and the model must be a 
multifractal. One can see that each occupied cell at the lowest level is produced 
by a different kind of scaling, which depends on the particular assignment of 
a p at each stage of the hierarchy. 
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Although multifractal models can give a good insight into the scaling prop- 
erties of the galaxy distribution, there is as yet no connection between this 
description and the dynamical origin of clustering. For the time being such 
models must be regarded as purely phenomenological. Examples of multifrac- 
tal models based on hiearchical fragmentation have been discussed [ |370| . [279]| 



but the models involved are quite crude. One interesting point can be raised in 
connection with some of the previous discussion in this paper. As we shall see, 
the observed distribution of scaling indices corresponds roughly to a quadratic 
f(a) spectrum, at least around the maximum. Now f(a) is related to D q by 
a Legendre transform: 

&t t(q) 
r(q) = aq-f(a), a(q) = — , D q = 7~~[y- ( 2 ™) 

A parabolic f(a) is therefore equivalent to a linear dependence of D q upon q; if 
the f(a) degenerates to a 5-function then D q is constant, as discussed before. 
It can be shown that the former situation leads to a log-normal distribution of 
cell-counts ||279| ; this shows the consistency of the lognormal approximation 



127|| . More generally, multifractal sets can be described as spatially intermit- 
tent, being characterised by large- void regions and dense clusters phenomeno- 
logically like the observed distribution. It has been suggested that the word 



heterotopic should be used to describe clustering of this kind 279 . 



7. Numerical Methods 



We have demonstrated that many aspects of the structure formation process 
can be understood analytically, with the aid of the various approximation 
schemes discussed in previous sections. Nevertheless, many situations remain 
outside the reach of analytical methods and one is compelled to adopt a nu- 
merical approach to solve the equations of motion of the gravitating matter. In 
general, numerical simulations are also used to test the applicability of analyt- 
ical approximations under the assumption that they give the closest possible 
representation of the 'true' non-linear density field. Although this review is 
intended to cover the analytical schemes in most detail, we feel it is appropri- 
ate to give a brief description of the various numerical methods on offer. Part 
of the purpose of doing this is to remind the reader that these methods are 
themselves approximate, and one should not assume that numerical simula- 
tion techniques are therefore necessarily the best way to answer every possible 
question about the non-linear clustering pattern. 

Any numerical simulation will have limitations in terms of size and resolution. 
These limitations stem from unavoidable constraints on memory and computa- 
tional time. The size constraint means that one can only handle the evolution 
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of a relatively small, but hopefully 'representative', piece of the initial hyper- 
surface upon which the primordial density perturbations are laid down. One 
must therefore at least run an ensemble of simulations to assess the 'cosmic 
variance' in any statistic extracted from the final simulation configuration. It 
is not possible in a simulation to assess easily the effect of varying the initial 
power spectrum or fluctuation statistics: one has to run the simulation again 
from scratch for each choice. Despite these limitations, numerical simulations 
have led to considerable increases of our understanding of the clustering pro- 
cess over the past two decades. 

The literature already contains many excellent reviews of numerical techniques 
in cosmology, so we shall not go into great detail here about optimal param- 
eter choices, resolution effects and numerical artefacts. What we shall do is 
run briefly through the basic techniques and point the reader towards more 
detailed references where appropriate. 



7.1. Direct N-body Simulation 



The simplest way to compute the non-linear evolution of a cosmological fluid 
is to represent it as a discrete set of particles, and then sum the (pairwise) 
interactions between particles directly to calculate the Newtonian force on 
each one. With the adoption of a (small) timestep, one can use the resulting 
acceleration to update the particle velocity and then its position. The new po- 
sitions can then be used to re-calculate the interparticle forces, and so on. One 
should note at the outset that particle techniques are not actually intended to 
represent the motion of a discrete set of particles. The particle configuration 
is itself an approximation to a fluid. There is also a numerical problem with 
summation of interparticle forces: the Newtonian gravitational force between 
two particles increases as the particles approach each other and it is therefore 
necessary to choose an extremely small timestep to resolve the large velocity 
changes this induces. A very small timestep would require the consumption of 
enormous amounts of CPU time, so one usually avoids the problem by treat- 
ing each particle not as a point mass, but as an extended body. The practical 
upshot of this is that one modifies the Newtonian force law between particles: 

p Gm 2 {xj-Xj) 
{e z + \%i — Xj\ z ) ' 

where the particles are at positions x« and they all have the same mass m; the 
form of this equation avoids infinite forces at zero separations. The parameter 
e in eq. Q271[) is usually called the softening length and it acts to suppress 
two-body forces on small scales. This is equivalent to replacing point masses 
by extended bodies with a size of order e. Since we are not supposed to be 
dealing with the behaviour of a set of point masses anyway, the introduction 
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of a softening length is quite reasonable but it means one cannot trust the 
distribution of matter on scales of order e or less. 



If we suppose our simulation contains A particles, then the direct summation 
of all the A interactions to compute the acceleration of each particle requires 
a total of A(A — 1)/2 evaluations of ( 271 ) at each timestep. This is the crucial 
limitation of these methods: they tend to be very slow, with the timescale scal- 
ing roughly as A 2 . The maximum number of particles for which it is practical 
to use direct summation is of order 10000, which is not sufficient for realis- 
tic simulations of large-scale structure formation. This method was, however, 
used in the early days of structure formation studies P,p],|53| ; see also ||166|| . A 



standard direct A-body code, written by Sverre Aarseth, is published in [53 



7.2. Particle-Mesh Techniques 



The usual method for improving upon direct A-body summation for com- 
puting inter-particle forces is some form of 'particle-mesh' (PM) scheme. In 
this scheme the forces are solved by assiging mass points to a regular grid and 
then solving Poisson's equation on the grid. The use of a regular grid allows 
one to use Fourier transform methods to recover the potential which leads to 
a considerable increase in speed. A comprehensive account of PM methods 
is given in the standard textbook on particle methods ||259|| and cosmological 



applications are described by ||174j| and ||1 73|| . We shall merely give an outline 



here, following | 173 |. The basic steps in a PM calculation are as follows. 



In the following, n is a vector representing a grid position (the three compo- 
nents of n are integers); is the location of the i-th particle in the simulation 
volume; for simplicity we adopt a notation such that G = 1, the length of the 
side of the simulation cube is unity and the total mass is also unity; M will 
be the number of mesh-cells along one side of the simulation cube the total 
number of cells being A; the vector q = n/M. First we assign mass points to 
the grid: 

M 3 N 

P® = -JT E^rS), (272) 

i=l 

where W defines a weighting scheme designed to assign mass to the mesh 
scheme. We then calculate the potential by summing over the mesh 

^ = ^ES(?-^)^: ( 273 ) 

compute the resulting forces at the grid points, 

F(q) = ~3<f>; (274) 
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and then interpolate to find the forces on each particle, 



F(x t )=Y.W(x t -q)F(q). (275) 



In equation ( f274| ), D is a finite differencing scheme used to derive the forces 
from the potential. We shall not go into the various possible choices of weight- 
ing function W in this brief treatment: possibilities include 'nearest gridpoint' 
(NGP), 'cloud-in-cell" (CIC) and 'triangular shaped clouds' (TSC). Each has 
pros and cons which are described by [ 173 ] , who also discuss possible choices of 



finite-differencing scheme for eq. Q274T ). For a general text on particle methods, 
see 



The most important advantage of the PM method is that it allows a fast 
summation of the forces. The potential on the grid can be written 



0(7, m,n) = J2 &(P> <?, r )f>(Pi <?, r) exp 



p,q,r 



2m. , . 
— - (pi + qm + rn) 



(276) 



where the 'hats' denote Fourier transforms of the relevant mesh quantities. 
There are different possibilities for the Green's function Q, the most straight- 
forward being simply 

G(p, q, r) = ~\ (277) 

7i (jr + q l + r z ) 

unless p = q = r = 0, in which case Q = 0; alternative, more complicated, 
forms for the Green's function are possible [ |173|| . Equation ( |276| ) represents 
a sum, rather than the convolution in equation ( |273| ), and it's evaluation 
can therefore be performed much more quickly. A Fast-Fourier Transform is 
basically a processs which is of order N log N in the number of grid points 
and this represents a substantial improvement for large iV over the direct 
summation technique. The price to be paid for this is that the usual Fourier 
summation method implicitly requires that the simulation box have periodic 
boundary conditions. This is probably the most reasonable choice of boundary 
conditions for simulating a 'representative' part of the Universe. Methods do 
exist, however, for handling other types of boundary, such as isolated ones, 
and one can add an exterior potential by linear superposition. 

The potential weakness with this method is the comparatively poor force 
resolution on small scales because of the finite spatial size of the mesh. A 
substantial increase in spatial resolution can be achieved by using instead the 
'particle-particle-particle mesh' method (P 3 M), also described in detail by 
|1 74| . |259| , |3 1 1| , |1 73|j46H . Here, the short-range resolution of the algorithm is im- 



proved by adding a correction to the mesh force. This contribution is obtained 
by summing directly all the force contributions from neighbours within some 
fixed distance r s of each particle. A typical choice for r s will be around 3 grid 
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units. Alternatively, one can use a modified force law on these small scales to 
assign any particular density profile to the particles, similar to the softening 
procedure demonstrated in equation ( |271| ). This part of the force calculation 
may well be quite slow, so it is advantageous merely to calculate the short- 
range force at the start for a large number of points spaced linearly in radius, 
and then find the actual force by simple interpolation. The long-range part of 
the force calculation is done by a variant of the PM method described earlier. 

One of the problems with straightforward P 3 M is that when clustering is 
highly developed, many particles in dense regions have separations < r^. This 
means that the algorithm spends much of its time calculating particle-particle 
forces directly and the saving introduced by using the mesh potential can be 
lost. An ingenious resolution of this problem was suggested by Couchman 



136|| . This idea involves using sub-grids to speed up the direct summation of 
the short range forces. Essentially, one zooms in on a small part of the original 
simulation grid and uses a finer mesh to apply a PM calculation within this 
part. The algorithm can speed up a large iV-body simulation by a factor of 



100 or more |[46|| ; it also allows a fully parallel implementation. Using these 
methods it is feasible to run simulations^] of 256 3 particles on a 512 3 mesh 
within a reasonable time, if one has access to a supercomputer. Even a fairly 
ordinary desktop workstation can easily cope with 128 3 particle simulations. 

Variants of the PM and P 3 M technique are now the standard workhorses for 
cosmological clustering studies. Different workers have slightly different inter- 
polation schemes and choices of softening length, etc. Whether one should 
use PM or P 3 M in general depends upon the degree of clustering one wishes 
to probe. Strongly non-linear clustering in dense environments probably re- 
quires the force resolution of P 3 M. For larger scale structure analyses where 
one does not attempt to probe the inner structure of highly condensed ob- 
jects, PM is probably good enough. One should, however, recognise that the 
short-range forces are not computed exactly, even in P 3 M, so the apparent 
extra resolution may not necessarily be saying anything physical. Moreover, 
both P 3 M and the tree codes we discuss below handle fewer particles than 
PM for the same investment of CPU time, leading to poorer mass resolution 
and greater discreteness effects. As Melott has pointed out, "More resolution 



isn't necessarily better resolution" ||384|| . Nevertheless, in situations where one 
needs to follow the evolution of perturbations into the strongly non-linear 
regime, for instance when one is looking at the formation and evolution of in- 
dividual galaxies or clusters, the P 3 M algorithm may represent a substantial 
improvement over plain PM. 



27 Notice that the FFT algorithm is optimal if the number of mesh points is a power of 
two, so most simulations involve parameters obeying this choice. 



Ill 



7.3. Tree Codes 



An alternative procedure for enhancing the force resolution of a particle code 
whilst keeping the necessary CPU demands within reasonable limits is to 
adopt an hierarchical subdivision procedure. The generic name given to this 
kind of technique is 'tree codes'. The basic idea is to treat distant clumps of 
particles as single massive pseudo-particles. The first application of this kind 
of method to cosmology was suggested by Barnes & Hut |24]] whose algorithm 
involves a mesh which is divided into cells hierarchically such that every cell 
which contains more than one particle is divided into 2 3 sub-cells. If any of the 
resulting sub-cells contains more than one particle, that cell is sub-divided 
again. There are some subtleties involved with communicating particle posi- 
tions up and down the resulting 'tree', but it is basically quite straightforward 
to treat the distant forces using the coarsely-grained distribution contained 
in the high level of the tree, while short-range forces use the finer grid. The 
problem with such codes is that, as well as running slower than P 3 M for the 
same resolution, they do require substantially greater memory resources. Their 
use in cosmological contexts has so far therefore been quite limited, as mem- 
ory is often the limiting factor for simulation work, rather than CPU speed. 
More details of the procedure, as well as various tests and a comparison with 
particle-mesh techniques are given by p4|,|76|,p33|,[481||. 



There has been a growing effort in recent years to parallelise N-body codes 
in order to take advantage of the massively parallel machines such as the 
CM-5 and the CRAY T3D. These methods distribute the computational load 
evenly amongst N-processors (where N can be as large as 1024 for the CM5). 



A discussion of parallel tree codes can be found in [480,553 . 



7.4- Hydrodynamical Methods 



In most of this section, and indeed most of this article, we have concentrated 
upon schemes for evolving the purely gravitational interactions of the distribu- 
tion of matter. This may be reasonable in structure formation scenarios where 
the dominant contribution is from collisionless dark matter, but in strongly 
non-linear structures one expects pressure forces due to the baryonic compo- 
nent to become dynamically important, perhaps resulting in shock formation 
and dissipation. Moreover, one ideally wants to know about cooling and star 
formation in collapsing structures to have some idea of how to relate the dis- 
tribution of material with the distribution of luminosity revealed in galaxy 
catalogues. The field of cosmological hydrodynamics is very much in its in- 
fancy, and it is fair to say that there are no analytic approximations that can 
be implemented with great confidence in this kind of analysis, though some 
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phenomenological models of galaxy formation have been suggested based on 
analytical considerations 1 1565| , 110| , 328 J327 , 302 , pQ3| , 11 1 , 300 1- For the most part, 
however, hydrodynamics requires a numerical approach; we shall briefly review 
the two main methodologies in this section. 



7.4-1- Smoothed Particle Hydrodynamics 

In smoothed-particle hydrodynamics (SPH) one typically represents the fluid 
as a set of particles in the same way as in the iV-body gravitational simulations 
described in §7.1-7.3. Densities and gas forces at particle locations are thus 
calculated by summing pairwise forces between particles. Since pressure forces 
are expected to fall off rapidly with separation, above some smoothing scale 
h (see below) it is reasonable to insert the gas dynamics into the part of a 
particle code that details the short-range forces, i.e. the particle-particle part 
of a P 3 M code. This makes it natural to insert SPH dynamics into a P 3 M 
code, as suggested by Evrard | |185| ] in the following manner. 



One technique used to determine local densities and pressure gradients is 
known as kernel estimation. This is essentially equivalent to convolving a field 
f(x) with a filter function W to produce a smoothed version of the field: 



f.{r) = J d 3 xf(x)W(x-r), (278) 



where W contains some implicit smoothing scale h; one possible choice is a 
Gaussian. If f(x) is just the density field arising from the discrete distribu- 
tion of particles then it can be represented simply as the sum of 5-function 



contributions at each particle location and one recovers equation (272). We 
need to represent the pressure forces in the Euler equation (|T6|): this is done 
by specifying the equation of state of the fluid P = (7 — l)ep, where e is the 
thermal energy and p the local density and P the pressure. Now one can write 
the pressure force term in equation 



VP 





as 






«( 










(279) 



P 

Now the gradient of the smoothed function f s can be written 

V/,(f) = / d 3 xf(x)VW(x-r), (280) 
so that the gas forces can be obtained in the form 



/V/M «-£(§ + §Wfo). (281) 



V p j t y\pt pv 
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The form of equation (|281|) guarantees conservation of linear and angular mo- 
mentum when a spherically symmetric kernel W is used. (The smoothing scale 



h need not be constant [118511, but care must be taken to include derivatives 



of h in a self-consistent manner to be sure the equations are conservative 



409 1 .) The adiabatic change in the internal energy of the gas can similarly be 



calculated: 



^(DEV^)-^ (282) 



where Vij is the relative velocity between particles. For collisions at a high 
Mach number, thermal pressure will not prevent the particles from streaming 
freely, but in real gases there is molecular viscosity which prevents penetration 
of gas clouds. This is modelled in the simulations by introducing a numerical 
viscosity, the optimal form of which depends upon the nature of the simulation 
being attempted; see the discussion in ||185||. 



It is also possible to implement SPH on the back of a gravitational tree code, as 
described in §7.3. A substantial study by Hernquist & Katz ||257|| has recently 



been devoted to such methods so we shall not describe them in more detail 
here. Cosmological applications of SPH methods and their variants can be 
found in p95|TOJ296|J408] , |298| , |l86| . Note also that tree-SPH methods have 



been used extensively in stellar collapse problems, e.g. [40S . 



7.4-2. Eulerian Hydrodynamics 

The advantage of particle-based methods is that they are Lagrangian and con- 
sequently follow the motion of the fluid. In practical terms, this means that 
most of the computing effort is directed towards places where most of the 
particles are, and therefore, where most resolution is required. As mentioned 
above, particle methods are the standard numerical tool for cosmological sim- 
ulations. 

Classical fluid dynamics, on the other hand, has usually followed an Eulerian 
approach where one uses a fixed (or perhaps adaptive) mesh. Codes have been 
developed which conserve flux and which integrate the Eulerian equations of 
motion rapidly and accurately using various finite-difference approximation 
schemes. It has even proved possible to introduce methods for tracking the 
behaviour of shocks accurately - something which particle codes ofted struggle 
to achieve. Two examples of Eulerian codes used for cosmology are detailed in 
106|| and |[4T0|| . Typically, these codes can treat many more cells than an SPH 



code can treat particles, but the resolution is usually not so good in some 
regions because the cells will usually be equally spaced rather than being 
concentrated in the interesting high-density regions. We refer the reader to 
the above references for details of these codes; we have no space to describe 
them here. 
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An extensive comparison between Eulerian and Lagrangian hydrodynamical 
methods has recently been performed in [292], which we recommend to anyone 
thinking of applying these techniques in a cosmological context. Each has its 
advantages and disadvantages. Density resolution is usually better in state- 
of-the-art Lagrangian codes, except in low-density regions where the number 
of particles is so small that statistical effects dominate. On the other hand, 
Eulerian codes generally resolve the temperature much better, except in very 
high density regions where the numerical viscosity is artificially large. 



7.5. Initial Conditions and Boundary Effects 



To complete this section, we make a few brief remarks about starting condi- 
tions for iV-body simulations, and the effect of boundaries and resolution on 
the final results. 

Firstly, one needs to be able to set up the initial conditions for a numeri- 
cal simulation in a manner appropriate to the cosmological scenario under 
consideration. For most models this means making a random-phase realisa- 
tion^ of the power-spectrum - see §2.4. This is usually achieved by setting 
up particles initially exactly on the grid positions, then using the Zel'dovich 
approximation, equation (|159|) , to move them such as to create a density field 
with the required spectrum and statistics. The initial velocity field is likewise 
obtained from the primordial gravitational potential. This procedure is quite 
straightforward and is discussed, for example, in ||166| , |173|| . One should be- 



ware, however, the effects of the poor /c-space resolution at long wavelengths. 
The assignment of fc-space amplitudes requires a random amplitude for each 
wave vector contained in the reciprocal-space version of the initial grid. As 
the wave number decreases, the discrete nature of the grid becomes apparent. 
For example, there are only three (orthogonal) wave vectors associated with 
the fundamental mode of the box. When amplitudes are assigned via some 
random-number generator, one must take care that the statistically poor sam- 
pling of fc-space does not lead to spurious features in the initial conditions. 
One should use a simulation box which is rather larger than the maximum 
scale at which there is significant power in the initial power-spectrum. At the 
opposite end of the resolution question is the finite spacing of the grid; this 
means that one should not trust structure on scales corresponding to greater 
than the Nyquist frequency of the initial grid, as these cannot be resolved by 
the mesh. Indeed, the resolution will be determined by whichever is the higher 
of the particle and mesh Nyquist frequencies. 

Related to this warning is the role of boundary effects on the simulation. Recall 
that the particle-mesh methods require the imposition of periodic boundary 



2N 



Non-Gaussian initial conditions are discussed by, for example, |400[|557| . 
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conditions, which may also distort the structure on scales of order the simula- 
tion box. An ingenuous test of the effect of periodic boundary conditions was 
recently devised by Farrar & Melott |[189 | who imposed the topology of a Klein 
bottle upon the simulation box to see how much this altered the results with 
respect to the usual situation of a cube with opposite faces identified. The 
results suggest that boundary effects are minimal for the kind of simulation 
parameters used in most cosmological studies. 



8. Statistical Description of Galaxy Clustering 



To put our theoretical considerations into context we shall, in this section, 
give a concise review of the observed properties of the galaxy and cluster dis- 
tributions. We also give a fairly complete discussion of the various statistical 
descriptors used to characterise the essential features of the clustering pat- 
tern. Of course, different approximations will reproduce properties of the fully 
non-linear density field with varying degrees of accuracy. As we have seen 
before, in §4.2 and §4.7, techniques like the Zel'dovich approximation should 
reproduce well the large-scale geometry and topology of the distribution, but 
might be less accurate at matching 'hard' statistical properties like the corre- 
lation functions and cell-counts. Although we have been able in this review 
to explain how some approximations lead to estimates of low-order moments 
and correlation functions, their behaviour in terms of more sophisticated sta- 



tistical properties has only recently begun to be investigated ||493|| . We review 



these methods here to give some idea of the work still to be done. 

Let us, however, lodge an important reservation about all the statistical meth- 
ods that utilise only positional (and not velocity) information about galaxies. 
As we mentioned in §2.4, we do not know how to relate galaxy counts to mass 
fluctuations. So when we extract statistical parameters from a galaxy survey 
there is no unique way to relate that to parameters which we can calculate 
theoretically which are sensitive to the total mass. There could well exist a 
complex bias ||286| , |157| , p90| , |83|jT22|1 which intervenes in such a way that galaxy 



statistics are a gross distortion of the underlying mass distribution. In §8.10 
we discuss recent attempts to use the other half of phase space, peculiar veloc- 
ities, to try to probe the full mass distribution, but as we shall see, attempts 
to do this are in their infancy and the data are meagre. 

We begin by briefly reviewing the kinds of cosmological clustering data that 
are available at the present. 
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8.1. Galaxy Surveys 



Although the clustering of galaxies was observed empirically in the days of 
John Herschel, it is only in the last few decades that it has been possible to 
construct complete and well-controlled samples of galaxies with well-defined 
selection criteria. It is only more recently still that we have been able to build 
such surveys in three dimensions, using breakthroughs in instrumentation. 
Here we give a brief historical approach to the most recent developments in 
this field. 



8.1.1. Selection Functions 

Let us first define some simple terms. Since galaxies do not have the same 
intrinsic brightness, different objects can be seen at different distances from 
the observer with a detection method of a fixed flux (apparent luminosity) 
sensitivity. If n(L) is the number of galaxies per unit luminosity, then n(L) dL 
is the number in the luminosity range (L, L + dL). If the sample is complete 
over some volume, then the Luminosity Function (LF) is defined by the num- 
ber of galaxies per unit luminosity per unit volume: dL = n(L) dL/V s , 
where V s is the sample volume. If the luminosity function is universal, then 
$(L) is not a function of position and will be the same functional form for any 
sample selection criteria. This indeed appears to be the case and the correct 
form is given by: 

m dL = exp (--j±) d (£) , (283) 



which is called the Schechter luminosity function ||500|| . Typical values for the 
parameters are a ~ —1.25 and L* ~ 10 10 L Q ; $* is an overall normalisation 
constant, giving the number-density of galaxies in the sample. 

In general, of course, we cannot construct a sample consisting of every galaxy 
within a given volume. The usual method is to select all galaxies above some 
fixed flux limit. If this apparent luminosity limit is I, then one can easily 
calculate the intrinsic luminosity L(l,r) which a galaxy would have to have 
at distance r to be just at the threshold of inclusion in the sample. Let us 
put this in dimensionless form by defining x(r) to be this limiting magnitude, 
x(r) = L(r)/L* and to be the dimensionless luminosity of the faintest 
source included in the survey at any distance. The selection function 0(r) is 
then 



^284) 



This equation ( [284] ) is appropriate for optical galaxies; IRAS galaxies, for 
example, have a different luminosity function, with a much shallower high- 
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luminosity cutoff than this and so can be used to sample comparatively larger 
scales [333]. Because some galaxies are missed if one deals with a magnitude 
limited survey, some statistical correction must be applied to galaxies at large 
radial distances from the observer. This generally involves weighting galaxies 
at large r by 

f(r) = ^- (285) 

but different weightings are appropriate for different statistical treatments, 
such as when one wishes to construct a minimum variance estimator for some 
statistic. 



8.1.2. Projected Catalogues 



Before the availability of redshifts for large samples of galaxies, the main 
source of clustering was two-dimensional information about galaxy positions 
on the celestial sphere. Historically important catalogues of this kind were the 
Zwicky and Jagellonian sky surveys. A dramatic improvement in the quality 
and quantity of data was achieved with the Lick galaxy catalogue, in the 
form of counts of galaxies in 10 x 10 arcminute cells compiled by Shane & 
Wirtanen ||510|| . Although over 25 years old, this catalogue, which contains 
approximately 10 6 galaxies, is still yielding new and important constraints on 
models of galaxy formation and clustering, e.g. ||131| . 



Doubts have been raised about the Shane & Wirtanen and earlier catalogues, 
on the basis of the relatively crude visual methods used to count the galaxies 



131, 440, 25 1 . More recent catalogues of this kind are compiled using automated 



plate-scanning devices such as APM [ |356|| and COSMOS |[132|| . The enormous 
numbers of galaxies that can be counted above the flux limit used in these 
surveys to some extent compensates for the lack of redshift information, and 
these two-dimensional catalogues are still yielding important constraints on 
structure formation scenarios. 



8.1.3. Redshift Surveys 



Given the comparatively long time needed to obtain the spectrum of a faint 
galaxy, one needs to plan observational strategies carefully in order to obtain 
a well-controlled sample of galaxy redshifts. For a general review of the art of 
the redshift survey, see |[252| . 



One method is to examine only a small area of the sky, but with a very low 
limiting flux density. According to this approach, one obtains a narrow cone or 
pencil-beam shaped survey volume. Prominent examples of the use of this kind 
of survey are the void searches of Kirschner et al. ||309|| , and the north-south 
galactic pole pencil beam analysis of [8~7|. Here, of course, the information one 
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obtains is effectively one-dimensional. 



Another possible strategy is to look at a larger angle, but to a somewhat 
higher limiting flux density. This can be achieved by looking at a narrow an- 
gular strip as favoured by the Cf A survey program ||159|J160| . |555|| . These surveys 
produce data which is effectively almost two-dimensional, usually represented 
in the form of a wedge plot. Surveys covering a larger area of the sky, and 
thus generating truly three-dimensional samples include the Perseus-Pisces 
sample ||253| , the local supercluster survey ||540fl , the Southern Sky Redshift 



Survey (SSRS) ||142| | and the large-area CfA survey | |149[ | . The combined up- 
dated SSRS and CfA surveys include over 15000 galaxies (to magnitude limit 
m s(o) = 15.5) and cover over 3 steradians of the sky |[L43|| . Looking ahead, the 
upcoming Sloan survey is underway which should provide redshifts of around 
a million galaxies with mg > 19.5 by the end of the century. 

Of course, with a terrestrial telescope, one is always constrained to cover that 
part of the sky accessible from a given site. Not so with a satellite. The IRAS 
satellite detected a great number of extragalactic sources, yielding a parent 
all-sky catalogue containing approximately 12000 galaxies. Follow-up redshift 
surveys, include a complete survey down to a limiting flux of 2 Jansky at 60 
micron [528], and a randomly-selected one-in-six subset of these galaxies down 
to 0.6 Jansky ||333|| . The latter of these, named QDOT by the collaboration, 
is one of the richest sources of cosmological data. The shallow fall-off of the 
luminosity function of IRAS galaxies means that the the survey probes scales 
> lOOh- 1 Mpc. 



8.1.4- Rich Clusters 

One of the most striking features of the distribution of galaxies, obvious even 
without detailed statistical analysis, is the existence of extremely dense objects 
like the Coma cluster. Such objects contain many hundreds of galaxies in a 
volume where, on average, one would expect only a few if galaxies were not 
clustered. The most natural assumption to make is that these "rich" clusters 
of galaxies are structures which are fully (or almost fully-) collapsed objects. 
Since the typical mass of these objects is of order 10 15 M Q , they represent a 
much larger mass and length scale than galaxy samples can probe. 

The first systematic survey of the properties of rich clusters was performed 
by Abell J3|. He made a catalogue of the clusters he could see in projection 
on the sky by counting objects inside an 'Abell" radius (approximately 1.5 
Mpc). The clusters he detected were ranked into 'richness classes' according 
to how many bright galaxies were within the Abell radius; he also assigned 
clusters into 'distance classes' according to an estimated distance. The original 
catalogue contained 2712 clusters in the redshift range 0.02 < z < 0.2 in the 
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northern hemisphere; an extension to the southern hemisphere was published 
after Abell's death ||. The spatial density of Abell clusters decreases rapidly 
with increasing richness, the mean density of R > 1 clusters is ~ 10~ 5 h 3 
Mpc" 3 . 

As we shall see later, clusters are themselves clustered, so that Abell clusters 
might be expected to trace structure on very large scales-^]. The problem is, 
however, that rich clusters are defined in projection on the sky and, since the 
Abell radius is quite large, it is possible that many chance superpositions of 
galaxies might occur, leading to contamination of the catalogue especially in 
the poorer clusters. This, together, with the fact that Abell's catalogue was 
compiled by a relatively primitive eyeball technique, has led many workers 
to be suspicious that the structures traced by the rich clusters are merely an 
artifact of the selection procedure. A fuller list of references about the possible 
effect of contamination on the statistics of superclustering is given in §8.2. 
More recently, automated plate-scanning devices such as APM and COSMOS 
have been used to define cluster samples using similar criteria to those that 
Abell used, but with a smaller effective 'Abell' radius to reduce the possibility 
of projection contamination ||145| , |172| , |410|1 . This catalogue covers, however, a 



much smaller part of the sky than the combined Abell/ACO sample. 

To construct a three-dimensional cluster catalogue, one must proceed from a 
two-dimensional parent catalogue in a manner similar to the procedure for a 
galaxy redshift survey. The main difference is that one really needs too many 
galaxy redshifts in each cluster because cluster galaxies have large peculiar 
motions and, as discussed above, some of the galaxies may be line-of-sight 
interlopers at a different distance from the bulk of the cluster. The large num- 
ber of redshifts required means that relatively few clusters have published 
redshifts. The largest compilation of Abell cluster redshifts is published in 



529|| , but there are several erroneous redshifts and some are based on only 
one cluster galaxy. It is also possible to estimate redshifts using the known 
correlation between mio (the apparent magnitude of the tenth-brightest clus- 



ter galaxy) and distance. The available data sources are listed in |[435|| , which 
also contains a discussion of the procedure for estimating redshifts. To avoid 
projection contamination altogether, one might use clusters selected in X-ray 
surveys, because rich clusters contain hot gas whose temperature should fur- 
nish an estimate of the virial mass of the cluster [57] . The use of X-ray selected 



clusters for cosmological studies is, however, in its infancy and we shall not 
discuss it further in this paper. 



29 A less restrictive selection criteria for clusters was employed by Zwicky in his catalogue 
of ~ 9, 700 clusters. Zwicky's clusters are often of lower density than Abell's and sometimes 
contain two or more subclumps within themselves [ 

30 Clusters of clusters are usually called 'superclusters', but superclusters are much less well- 
defined objects than Abell clusters, because they correspond to a much smaller overdensity; 
S ~ 1 for superclusters, whereas S > 100 for clusters. 



583 1; for a review see p 
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8.1.5. Peculiar Motions 



As we remarked earlier, galaxy positions probe only half the relevant phase- 
space needed to fully specify the dynamical behaviour of the mass distribution. 
Peculiar velocities not only provide an insight into the other half, but are 
sensitive to the total amount of mass present. One might expect them therefore 
to provide some information about the amount of bias present. The amount 
of information available is, however, quite small and the errors are difficult to 
control. One first needs a sample of galaxies with measured redshifts z. One 
then needs to measure the distance to the galaxies by some means which is 
independent of the redshift. This typically means the Tully-Fisher relationship 
for spiral galaxies and the Faber-Jackson relationship for elliptical galaxies 
||187] , |354| , p8| . The peculiar motion of the galaxy in the radial direction from 
the observer is then 

v p {f) =cz- H r. (286) 

Galaxies are just assumed to be tracers of the velocity flow in this analysis so 
one does not need to worry too much about the possible presence of a bias. 

The problem with these studies is managing to find distances to a sufficient 
accuracy. While redshifts are quite easy to measure to very high accuracy, 
distances can be obtained directly to no better than 10-20%. What is worse is 
that one uses the continuity equation to relate velocities to mass (at this level 
one does have to worry about the bias), and any systematic dependence of 
the performance of the distance indicator upon the local density will generate 
spurious peculiar velocities in dense environments and thus throw the whole 
picture into confusion. There is indeed some evidence of this kind of effect 



241 1 so one should interpret velocity field studies very carefully. We discuss 



some of the methods of analysis in §8.10. 



8. 2. The two-point correlation function 



For many years the standard tool for quantifying the clustering of extra- 
galactic objects has been the two-point correlation function, denoted £(r) 
PIPOIPWTI . This is defined by 



dP 2 = n 2 [1 + £(r)] dVxdVs, (287) 

where dP 2 is the (infinitesimal) probability of finding two objects in the (in- 
finitesimal) volumes dVi and dV 2 ; n is the mean number-density of objects. 
We have implicitly assumed in ( |287| ) that the correlation function depends only 
upon the separation of the two volume elements, i.e. that the distribution is 
statistically homogeneous and isotropic. It is obvious from the definition that 
£(r) represents the excess probability of finding an object at a distance r from 
a given object, compared to a random (Poisson) distribution. This definition 
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differs by a factor n from the quantities used in condensed matter physics 



581| . |452|| . Notice that this statistic need not be applied only to galaxies, one 
could also calculate the correlation function of clusters, quasars, radio-galaxies 
or anything that can be represented by a point process. 

Of course, this definition is useful only if we have three-dimensional informa- 
tion about the positions of the galaxies. Fortunately, the analogous definition 
of the two-point correlation function in projection on the sky also turns out 
to be useful: 

dP 2 = 4 [1 + w(0)] dfii dn 2 . (288) 

In this case we are looking at the excess probability of finding a galaxy at 
an angle 9 from a given galaxy, in an obvious notation. The usefulness of this 
description is that, if the luminosity function of galaxies does not depend upon 



their position, then one can use the Limber equations ||342|| to deproject the 
distribution and obtain £(r) from w(9). This means that one can make use of 
the much larger projected catalogues to determine the real-space correlation 
function. 

The usual method for estimating £(r) uses a random Poisson point process 
generated with the same sample boundary and selection function as the real 
data; one can then estimate £ according to 

t, N DD(r) , . 

1+((r) ^im- (289) 

or 

DD(r) 

l + = (290) 

where DD(r), RR(r) and DR(r) are the number of pairs with separation r 
in the actual data catalogue, in the random catalogue and with one member 
in the data and one in the random catalogue, respectively. We have assumed 
for simplicity that the real and random catalogues have the same number of 
points (which they need not). The second of these estimators is more robust 
to boundary effects (e.g. if a cluster lies near the edge of the survey region), 
but they both give the same result for large samples. A thorough discussion 
of these and other estimators is given in [ 245] . 



There has been some difficulty in the past in knowing how to assign errors to 
the estimation of £(r) from a finite sample; a recent innovation has been the 
use of the bootstrap technique to estimate more realistic errors f26| , |344|| than 



the usual quasi-Poisson approach ||440|1 , but this is still not perfect. 



These definitions apply to a discrete set of points distributed in either two 
or three dimensions. In the previous discussion of density fields we have been 
dealing with a continuous density field and this description is not appropriate. 
For a continuous density field the analogous quantity is the autocovariance 
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function: 

(5(xi)5(f 2 )) = ^(1^1 - x 2 \) = f (r), (291) 

assuming statistical homogeneity and isotropy. In the case where galaxies oc- 
cur randomly with a probability per unit volume proportional to (1 + 5) then 
the form of £ is the same for both continuous and discrete definitions. If there 
is a strong bias, however, then this need not be the case | |126| , |122| | . Observation- 
ally, it is well-established that the two-point correlation function of optical 
galaxies has the power-law formP 1 ]: 



£gg( r ) = r o, 9 - Mpc, (292) 




with 7 = 1.8 for r < 10h~ l Mpc [|151| , |512j| . On larger scales the real-space 
correlation function is hard to measure from redshift surveys because of the 
limited size, so one can use the angular correlation function estimate (|288| ) 
and the Limber equation. The results for w(6) show a power-law slope on 
small scales (with a slope 1 — 7, expected from the properties of the Limber 
equation). On larger angular scales, an analysis of the Lick map suggested a 



break in the power-law and comparatively little large-scale clustering ||440 
More recently, however, a similar analysis of the APM and COSMOS maps 
have suggested a much larger amplitude of £ (r) on large scales, incompatible, 
for example, with the predictions of the standard CDM model [|356| , |132jj337|| . 
A direct determination of the large-scale correlation function of IRAS galaxies 
also indicates that the slope is different from that quoted for bright optical 
galaxies on similar scales ||368| 



The correlation function measured in redshift space £ r can differ significantly 
from its value in real space £ because of the effect of peculiar velocities of 



galaxies [ 287 |. In the estimates of the two-point correlation function on small 
scales, many of the pairs with small r belong to rich clusters which have 
large intrinsic velocity dispersions. Redshift and distance are therefore not 
equivalent for these galaxies. In terms of the linear growth factor /, defined 
by eq. fl22|), the mapping between redshift space and real space introduces 
a multiplicative error in the radial direction in the linear regime: £ r (f) — 
C(Q ){(r), where C(fi ) = 1 + (2//3) + (f 2 /5), and / is the usual growth 
factor /(O ) — ^0 6 - ft * s possible to use the observed anisotropy in redshift 
space to estimate / and thus Q ||243| , |244|| , but this linear analysis requires one 
to be looking at very large scales. 

It is worth also mentioning the properties of the rich cluster distribution. Rich 
clusters seem to have a correlation function which is of the same form as equa- 
tion ( |292| ), but with a different correlation length: r 0jC — 12 — 25h~ 1 Mpc. The 

31 Galaxies selected from the IRAS sample have a somewhat smaller correlation length and 



gentler slope r , 9 ~ 3.79/i _1 Mpc, 7 ~ 1.57 [525,194] 
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exact value of this is controversial, because of arguments about possible projec- 
tion contamination in the cluster catalogues |l7| , |460| , |53ll[i^|T56t[418|J532( | , but 
it certainly has a higher amplitude than the galaxy correlation function. In- 
deed, recent surveys of objects of varying richness such as galaxy groups, Abell 
clusters, X-ray clusters, superclusters and so on indicate that richer (and con- 
sequently rarer) objects exhibit stronger clustering (see figure (|T7|). It has been 
suggested that a good fit to observations is provided by the two-point corre- 
lation function £(r) = (0.4<ij/r) L8 where <ij is the mean separation of objects 
belonging to a given richness class [|T^,0. Furthermore, a cross-correlation 
analysis between clusters and galaxies in the Abell and Lick catalogues re- 
spectively, gives £ cs = (r cg /r) 2 with correlation length r cg = 15 ± 3/i _1 Mpc 



The increased amplitude of cluster clustering compared with galaxy clustering, 
and Kaiser's interpretation of this fact ||286|| , led to the work on the peaks 
model discussed in 53. 



8.3. Higher-order correlations 



The two-point correlation function is a useful statistic, but it only affords a 
complete description of the cosmological density field if galaxies constitute 
a Poisson sample of a Gaussian random field. To obtain a more complete 
statistical description we need to include probabilities of finding three objects 
in three disjoint volumes, and so on. For a set of n objects we need to know 
the m-point correlation functions for 2 < m < n in order to have a full 
description. 

The definition of the higher-order correlation functions is a straightforward 
generalisation of equation 



dP m = n 



in 



1 + C (m) (r)l dVi... dV m , (293) 



The function ^ m ^(r), however, contains contributions from correlations of or- 
ders less than m. For example, the number of triplets is larger than a random 
distribution partly because there are more pairs than in a random distribution: 

dP 3 = n 3 [1 + £12 + 6s + 6i + Cm] • (294) 

The part of ^ which does not depend on usually written £ 12 3, is called 
the irreducible or connected three-point function. The four-point correlation 
function ^ will contain terms in Qjk, iij^ki and £y which must be subtracted 
to give the connected function 771234- An equivalent term to describe the con- 
nected moment of order m is the cumulant, K m . Since one is generally most 
interested in the connected n-point functions, the cumulants are the most 
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Fig. 17. The amplitude of the two-point correlation function (scaled to 1 Mpc.) 
between objects belonging to different "richness classes" is shown as a function of 
their mean separation di- Reproduced, with permission, from [18]. 



natural functions to use in many applications. 

To construct higher-order cumulants of a continuous field one makes judicious 
use of the cluster expansion [^3| . The cumulants defined as the part of 



the expectation value 5\ . . . 5 n which cannot be expressed in terms of expec- 
tation values of lower order; 8\ = S(xi), etc. To determine the cumulant in 
terms of (6iS 2 ■ ■ ■ 5 n ) for any order one simply expresses the required expecta- 
tion value as a sum over partitions of the set {1, . . . , n}; the cumulant is just 
the part corresponding to the unpartitioned set J43|. For example, the possi- 
ble partitions of the set {1,2,3} are ({1},{2,3}), ({2}, {1,3}), ({3}, {1,2}), 
({1}, {2}, {3}) and the unpartitioned set ({1,2,3}). This means that the ex- 
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pectation value can be written 



< Ws) = (5i) c (5 2 5 3 ) c + (5 2 > C (M 3 > C + (5 3 >c(<W c + (6i) c {6 2 ) c {5 3 ) c + < W 3 ) c 

(295) 

The cumulant k 3 = (5i5 2 5 3 ) c , k 2 = (5i5 2 ) c , etc. Since (5) = by construction, 
K i = (<^i)c = = ||198|| . Moreover, n 2 = (8182)0 = (^1^2)- This definition 
can be extended to higher order quite straightforwardly. All the cumulants for 
n > 2 are zero for a Gaussian random field; for such a field the odd expectation 
values are all zero, and the even ones can be expressed as combinations of (5i5j) 
in such a way that the connected part is zero. 

One can define the cumulants corresponding to a discrete set of points in 
an analogous manner. Equation (|293|) gives the the probability of finding m 
galaxies in the m (disjoint) volumes 5Vi in terms of the total m-point corre- 
lation function £( m ). This function, however, also contains contributions from 
correlations of lower order than m and a more useful statistic is the reduced or 
connected correlation function which is simply that part of £( m ) which does not 
depend on correlations of lower-order; we shall use K m for the connected part 
of £( m >. One can illustrate the way to extract the reduced correlation function 
simply using the three-point function as an example. Using the cluster expan- 
sion in the form given in the preceding paragraph but, this time, assigning the 
the single partitions (5i) the value of unity for point distributions rather than 
the zero value one uses in the case for continuous fields [[200|| , we find 

5 3 P 3 = n 3 v [l + £(r 12 ) + £(r 23 ) + £(r 31 ) + C(r i2 , r 23 , r^SVMSV^ (296) 



where ( = k 3 is the reduced three-point function or, equivalently, the discrete 
cumulant. The terms £(7"jj) represent the excess number of triplets one gets 
compared with a random distribution (described by the '1') just by virtue of 
having more pairs than in a random distribution; the term ( is the number of 
triplets above that expected for a distribution with a given two-point correla- 
tion function. From now on we shall drop the term "connected" or "reduced" ; 
whenever we use an m-point correlation function, it will be assumed to be the 
reduced one. 

As with the two-point function, the higher-order covariances of a continuous 
density field have the same form as the higher-order correlations of a spa- 
tial point process if the point process is generated from the continuous field 
according to the Poisson model. In other cases this need not be so. The esti- 
mation methods and error analysis can be generalised to these functions from 
the two-point case ||440| . 



Observationally, it seems that the three-point cumulant is fairly well described 
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by the hierarchical form [449 , ^3^] : 



« 3 = ^(612623 + 6361 + 6162)- (297) 
with Q = 1.0 ± 0.1 p7piy5g2|P71P7|l . Objections to this claim have also 



been lodged |511| , |7"3 ,72|. This observation is the motivation for the Davis- 



Peebles closure ansatz for the BBGKY hierarchy; see §6.1. 



The generalisation of eq. ( |297|) to n > 3 involves a bit of combinatorial analysis: 

9 I (n-1) 

The notation here means a product over the n — 1 edges linking n objects, 
summed over all relabellings of the objects (I) and summed again over all 
distinct n-tree graphs with a given topology g weighted by a coefficient Q n , g - 
The four point term must therefore include two coefficients, one for 'snake' 
connections and the other for 'star' graphs. This is illustrated in Figure 



Attempts have been made to extract the hierachical coefficients for all n < 9 
209| , but this is certainly a laborious way to describe the galaxy distribu- 



tion. Rather than look at the behaviour of the moments separately, it might 
be better in general to construct the full distribution function of cell-counts 
and compare that with theory. Furthermore, because of the finite number of 
galaxies in any sample, it becomes very difficult to measure the correlation 
functions on large scales as n increases. 

Although the hierarchical model appears successful at fitting the observed 
correlations, one should be cautious about whether this does actually mean 
that the Davis-Peebles form of self-similar evolution is at work. For a start, 
such a hierarchy also developes to leading order in perturbation theory f4~3| , |3"4 



Also, one must bear in mind that other models for k 3 might be difficult to 
distingish on large scales from eq. fl297| ). For example, the lognormal model 
produces three-point correlations of the Kirkwood ||307|| form: 



^3 = (61263 + 6361 + 63162) + 61263631, (299) 

so that the hierarchical coefficient is Q, but there is an additional triplet term 
which does not fit the hierachical form. Notice, however, that this triplet term 
will be less than the hierarchical terms for small £ and the model therefore 
looks hierachical on large scales. A discussion of this problem in the context 



of higher-order cluster correlations is given in [p.47 ]. 
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Fig. 18. In the hierarchical model, different topologies of graphs connecting the 
n-points must be assigned a different weight. For n = 2 and n = 3, the different 
graphs connecting the points are topologically equivalent, but for n = 4 there are 
two distinct topologies. The topological difference can be seen by considering the 
result of cutting one edge in the graph. The first 'snake' topology is such that 
connections can be cut to leave either two pairs, or one pair and a triplet. The 
second cannot be cut in such a way as to leave two pairs; this is a 'star' topology. 
There are 12 possible relabellings of the snake and 4 of the star. For the n = 5 
function, there are three distinct topologies, illustrated in the figure with 5, 60 and 
60 relabellings. We leave it as an exercise for the reader to show that n = 6 has 6 
different topologies, and a total of 1296 different relabellings. 

8.4- Power Spectra 

There are many advantages, particularly on large scales, in not measuring 
the two-point correlation function directly, but through its Fourier transform. 
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The Wiener-Khintchin theorem states that, for a statistically homogeneous 
random field, the two-point correlation function is the Fourier transform of 
the power-spectrum (§2.4): 

= j d 3 kexp(ik-x)P(k) 

P(k) = / d 3 xexp(-ik-x)£(x). (300) 

The power-spectrum is, roughly speaking, proportional to the mean square 
amplitude of Fourier modes of the distribution. For power law primordial 
spectra with a short range cut-off: P(k) oc k n k < k c , P(k) = k > k c , 
£(x) oc £-( 3+n ) (n > —3), x » fc" 1 ; which can be used to deduce the power 
spectrum from a knowledge of £ in regions where £ can be represented as a 
power law. Being actually a spectral density function, however, P(k) must have 
units of volume [43| . This simple fact seems to have caused much confusion in 



the literature, and many authors insist on inserting unnecessary factors of the 
sample volume V into equation fl300| ) to account for this [|440|j430|| . To avoid 



this possible dependence upon the sample volume it is more useful to deal with 
a dimensionless power-spectrum A 2 (k) = 47rk 3 P(k). (Still, one must beware 
of different conventions with the factors of 2ir in the definition of the Fourier 
transform.) 

Studies based on the clustering of optical and IRAS galaxies seem to indicate 
that the observations can be described by a power spectrum which follows 
an approximate power-law form with a slope which ranges from n ~ —2 
on scales A < 3011" 1 Mpc to n ~ —1 on scales 30 < A < 50 — 100 h 
Mpc [^ CTj55T| , [T9^ , |191| , t426iPl . On very large scales A > 700 Mpc. the 



shape of the power spectrum is determined by the angular fluctuations in 



the Microwave Background Radiation which indicate n = 1.1 ±0.32 [ pi7]j563 
The observed power spectrum is compared with the standard cold dark matter 
power spectrum in figure (P?l). 

Peacock and collaborators have explored the power-spectrum of galaxy clus- 
tering for a number of different samples | |434| , f429| , [430| , f431| | and the results are 
reasonably well fitted by the functional form: 

A2 « = TTW^ ' (301) 

which describes a break between two power laws. The best-fitting value for 
the parameters are a = 1.50 ±0.03, (3 = 4.0 ±0.5, k c ~ 0.039 ±0.002/i Mpc" 1 , 
k ~ 0.29 ± 0.01/i Mpc -1 , but k depends quite sensitively upon the accuracy 
of the various selection functions. (The value of j3 is related to the primordial 
spectral index on large scales, (3 = 4 corresponds to the Harrison-Zel'dovich 
scale- invariant spectrum.) The form ( |301| ) on large scales, is similar to a low- 
density CDM spectrum or a mixed CHDM spectrum ||171| , |420|| . The effect of 
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Fig. 19. The power spectrum of matter reconstructed from observations of the 
Cosmic Microwave Background and from galaxy clustering data. The boxes on 
the left refer to MBR measurements conducted with (from left to right) COBE, 
FIRS, Tenerife, SP91-13pt, Saskatoon, Python, ARGO, MSAM2, MAX-GUM and 
MAX-MuP, MSAM3. The data points correspond to the following large scale sur- 
veys: stars (APM), crosses (CfA), triangles (IRAS-1.2Jy), squares (IRAS-QDOT). 
The solid line corresponds to the CDM spectrum normalised to COBE. The MBR 
power spectrum (for Qb = 0.06) is shown at the bottom. Reproduced, with permis- 
sion, from |563| 1 . 



non-linear evolution on small scales (high k) is discussed by Peacock & Dodds 
||431|| using the ansatz of Hamilton et al |[247|| , which is discussed in §6. The 
power spectrum of Abell cluster correlations has also been computed ||435 



the results are consistent with a rather large value for the correlation length, 
r — 21/i~ 1 Mpc and indicate that the clustering strength does depend on the 
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cluster richness. Estimates of the value of the spectral index on cluster scales 
indicate -1 < n < -2, p5l|E[9"l . The redshift -space anisotropy of P(k) can 
also be used to estimate Qq by analogy with the two-point correlation function 



291 


226 


194 


112 


255 



Since the power spectrum is the Fourier transform of the two-point correlation 
function, it would seem likely that similar transforms of the n-point correlation 
functions would also prove to be useful descriptors of galaxy clustering. This 
is indeed the case. For example, the Fourier transform of the three-point 
correlation function is known as the bispectrum | |440| , |205| , |3"2[ . The use of higher- 
order spectra is not widespread, but they may turn out to be a very effective 
way to detect non-Gaussian fluctuation statistics on very large scales. 



8.5. Cell-counts 



A simple but useful way of measuring the correlations of galaxies on large 
scales which does not suffer from the problems of the correlation functions 
is by looking at the distribution of galaxy counts in cells, Pn(V) (defined 
as the probability of finding exactly iV objects in a randomly placed volume 
V), or the low-order moments of this distribution such as the skewness 7 
and variance a 2 . Indeed some of the earliest quantitative analyses of galaxy 
clustering adopted this approach |265| , |582|,|51U| ]. More recently, observations of 
the spatial distribution of IRAS galaxies indicate that a = 0.5 ± 0.1 in a box 
of size (30/1- 1 Mpcf p7 ]. 



The counts in cells probability Pn(V) is the discrete analogue of the proba- 
bility distribution function (PDF) P(5) whose moments are defined as 

<5 n >= J 5 n P(5)d5 (302) 
-1 

For a discrete distribution of point sources this generalises to 

s <(tt) > = &(—) Pk(v) (303) 

where \ii = °~ 2 -, ^3 = 7!^ is the mean number of galaxies in a cell of volume V. 
Models such as the lognormal (|133|) and the gravitational quasi-equilibrium 



distribution ( 263 ) allow one to test the whole distribution of cell counts against 
the theory. Both the Poisson-sampled lognormal and the thermodynamic dis- 
tribution seem to fit the observations fairly well. 

Using only the moments of the cell-count distribution does result in a loss of 
information, but the advantage is a simple relationship between the moments 



131 



and the K n , e.g.: 

.W^((^) 2 > = i + , 2 (304) 

where 

R ri = T7Z \ ■■■ \ dVi . . . dV n K n (ri . . . r n ). (305) 



Apart from the Poisson term - which is a discreteness effect - the second 
order moment is simply an integral over the two-point correlation function. 
The same is true for higher order moments, but the discreteness terms are 
more complicated and the integrals must be taken over the cumulants. For 
example, the skewness 7 can be written 

, s ,/AiV\ 3 x 2 3a 2 1 3R 2 , , 

In terms of 7, our hierarchical parameter S3 is just 7/a 4 (S3 ~ for 
large N) ||126|| . Equation ( 304 ) provides a good way to measure the two-point 
correlation function on large scales and its use is discussed in [ |177|J497| |. Use 



of the skewness as a descriptor is discussed in [p.26| . [7^ . p83|j86| . 



The usual formulation is to write the ratio of the n-th order moment to the 
(n — l)-th power of the variance as S n . A moment's thought will then show 
that for a hierarchical model, the S n should be constant, independent of cell 
volume. For the simply hierachical distribution ( |297| ) we have that S3 = 3Q, 
which seems to be in reasonable agreement with measured skewnesses [p.26f| . Of 



course, there should be some scale dependence of clustering properties if the 
initial power spectrum is not completely scale-free, so one would not expect S3 
to be accurately constant on all scales in, for example, the CDM model. It is, 
however, a very slowly-varying quantity. It should be stressed that the higher 
order R n become increasingly sensitive to the presence of rare peaks of high 
density in a distribution and one should expect substantial errors in determi- 
nations of them from samples of relatively small size. Applications of the count 
probability approach to galaxy catalogues are dicussed in ||177y3471 , [208| , |80| . |210 



the use of count probabilities in the analysis of N-body simulations may be 
found in ^J77| J^7| , ggT] , gU7| , ggg| . Q 

As noted in §4.1, higher order cumulants are related to lower order cumulants 
in the weakly non-linear regime, by the correlation hierarchy R n = S n R2~ l ,n > 
1 [p4] , [440|jr98| , |213|| . As discussed above and in §8.3, such a scaling law can also 



be predicted by the BBGKY equations in the fully nonlinear regime. It is 
interesting that the observed values of S3 ~ R3/R2 and S4 ~ R^jR\ do appear 
to be approximately constant over a wide range of scales. (Studies of the 
IRAS sample indicate S\ RAS ~ 1.5 ± 0.5 on scales ranging from 1 to 25 h _1 
Mpc |80|| ). Thus within the considerable errors, there seems to be a roughly 

32 The role of the skewness in analysing the Cosmic Microwave Background is discussed in 
[f352|j35ltp4l. 
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hierarchical behaviour of the clustering data [126 , 7^ , 283 , 80] consistent with 
most gravitational instability models of structure formation. 



One might be tempted to use the observed value of S° BS in conjunction 
with the quasi-linear theory prediction: S3 = (8 3 ) c /(5 2 ) 2 = 34/7 — (n + 3) 
to determine the value of the spectral index n. However the fact that light 
need not trace mass makes this difficult in practice: the observed value S° BS 
and S3 are related by S° BS = S3/6, where b = 5 OBS /5 is the linear biasing 
factor. Thus, we need an independent estimate of n in order to determine b 
and vice-versa. A bias-independent means of determining the spectral index 
might be provided by the skewness and kurtosis of velocity fields which are 
discussed in §8.10. 

It is interesting to note that many different approaches to the clustering prob- 
lem agree with each other as far as the relatively low-order moments of the 
cell-count distribution. As we saw in §4, different dynamical approximations 
agree with perturbation theory in predicting hierarchical behaviour in the 
skewness. Moreover, the lognormal model (§3.5) also produces, in the limit of 
large scales, a value of S3 = 3. Although this is actually a Kirkwood model it 
does behave on large scales very much like an hierarchical model. A completely 
different approach using the gravitational quasi-equilibrium distribution of 
§6.2, also gives a constant S 3 = (1 + 2b). For values of b around 0.65 — 0.75, 
this also agrees roughly with the observations. Although it is encouraging that 
these different approximations do agree with each other to a reasonable degree 
and also seem to behave in roughly the same way as the data, it is advisable to 
be cautious here. The skewness is a relatively crude statistical descriptor and 
many different non-Gaussian distributions have the same skewness, but very 
different higher-order moments. One could proceed by measuring higher and 
higher-order moments from the data, but we do not feel that this is a very 
efficient way to proceed. It is perhaps better to focus instead upon the distribu- 
tion function of cell-counts, Pn{V), rather than its moments. The problem is 
that, except for a few special cases, it is not possible to derive the distribution 
function analytically even in the limit of large V. The observed form of P^iV) 
may be fairly well matched by the gravitational quasi-equilibrium distribu- 
tion (though see J79[), and also by the Poisson-Lognormal model [ |1271 . |103|| ; 
see §3.5. 

The distribution function of galaxy counts can be related to the moments (or 
integrals over the correlation functions) by the moment generating function; 
see |pOO| . It should be pointed out, however, that the moment generating 
function does not necessarily exist for all statistical distributions: the necessary 
series does not converge, for example, for the lognormal distribution |127 



For hierarchical cumulants, equation (|298|) , the series does converge; scaling 



properties of the observed Pn(V) do seem to be in accord with the hierarchical 
model ||378| , p79| , p04jj20|j550| , but scaling seems to be broken on large scales. 
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8. 6. Voids 



The distribution function of galaxy counts leads naturally on to the Void Prob- 
ablity Function (VPF) - the probability that a randomly selected volume V is 
completely empty. Properties of voids are also appealing for intuitive reasons: 
these are the features that stand out most strikingly in the visual appearance 
of the galaxy distribution. As demonstrated in [ |564 !, [Zm the generating func- 



tion of the count probabilities V(X) = J2n=qX n Pn(V) can be shown to be a 
sum over the mean value of cumulants of all orders 



logP(A) = £ 



(A-l)» 



n=l 



111 



{N) n R r , 



(307) 



Setting A = in eq. (|307|) we obtain 

iogP 00 = E 



-N) 7 



-K r 



n=l 



TV. 



(30£ 



again as long as this sum converges [ 127|| . In an unclustered Universe, k± = 1 
(because this is a discrete distribution: §8.3) and R n = for n > 2 and 



P (V)=exp(-N), P N (V) 



(N) 



N 



exp(-iV) 



(309) 



which is just the Poisson distribution. In the weakly nonlinear regime equation 
( |307|) can be rewritten as |34| 



logP(A) 



ip[(l-\)Na 2 



a" 



(310) 



where ip(y) is the generating function for the moments S p 



v{y) = J2 S p 

P =i 

the resulting VPF has the form 

P (V) = exp 

where y = Na 2 . 



pi 



(311) 



—N 



<p(y) 



(312) 



The VPF is quite easy to extract from simulations and real data and turns 
out to depend strongly upon correlations of all orders and is therefore a poten- 
tially useful diagnostic of the clustering: many authors have investigated its 
properties pD^CT^ , pg|P^ , |^ , p01 , ^ , |i^j4^ , |3^ . These studies again 
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seem to support the view that clustering on scales immediately accessible to 
observations is roughly hierachical in form. 



Although the VPF is unquestionably a useful statistic, it pays no attention 
to the geometry of the voids, or their topology. Typically one uses a spherical 
test volume so a flat or filamentary void will will not register in the VPF with 
a V corresponding to its real volume. Moreover, because the voids which seem 
most obvious to the eye are not actually completely empty, these do not get 
counted in the VPF statistics. It would be good to have an alternative statistic 
for measuring void properties. One possibility is the void spectrum discussed 
in ||299| , |301| , [476| 1 and §5.3, which seems to be a good statistical discriminator 



between models of structure formation. 



8.7. Geometrical Descriptors 



The possibility that galaxies lie preferentially along filamentary or sheet- 
like structures separating large voids has aroused considerable interest during 
the last decade ||277| , |159| , |160| , |555|| . Because of the possible connection between 



these structures and the classical Zel'dovich pancake scenario, strenuous at- 
tempts have been made to find objective methods for quantifying the geomet- 
rical properties of galaxy clustering. (Bearing in mind the 'Canals of Mars' 
fiasco, one should also be aware of the problem that one's eye is easily misled 
into constructing filaments and such structures from images where they do 
not really exist fl25|.) 



One of the first methods suggested was based on percolation theory and was 
borrowed from condensed matter physics ||581|j576| , |505| , |577| l. To illustrate the 



method, suppose we have n galaxies in a sample cube of side L; the mean 
separation of galaxies is defined to be / = Ln~ 1 ^. Around each galaxy we 
construct a sphere of diameter d = bl, where b is a dimensionless quantity. If b 
is small, virtually all the spheres will be disjoint; as it increases, neighbouring 
spheres will overlap in clusters forming percolated regions while isolated spheres 
will still be disjoint. At some critical value b*, the largest percolated region 
traverses the whole volume: L p (b) = L and b = b*, where L p is the size of 
the largest percolated region. The value of b depends on the local geometry 
of the distribution. For a regular lattice, 6 = 1 and for a random (Poisson) 
distribution the most probable b* is b* = b* = 0.86. A distribution containing 
filaments and sheets would percolate more easily than this and would have 
b < 0.86; a system characterised by isolated dense clumps would have b > 0.86. 
For discussion of this method, including sampling problems and more detailed 
percolation statistics, see |7^g05] ,^g, g5g , g7gj3T2| , g09[ . 



There is some degree of consensus in the literature that the percolation method 
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is not very successful at discriminating between the kind of data and theo- 
retical models involved in cosmology. For example, HDM and CDM models 
which both have b* ~ 0.65. To some extent, however, this conclusion refers 
not so much to the percolation method itself, but to the attempt to encode all 
the properties of the percolation process in one single parameter b*. Since the 
critical percolation threshold is determined by the size of the largest structures 
in the simulation box, one might have expected this not to be a very sensi- 
tive discriminant. A more detailed analysis might well prove more successful. 
For example, consider what happens as the percolation radius is increased. In 
the beginning, one has a large number of isolated clusters. As the radius is 
increased, these tend to merge together producing a smaller number of larger 
clusters some of which may be highly elongated. Eventually, when percolation 
occurs, one cluster is the size of the simulation box, but there may be other 
clusters remaining which are smaller than this biggest one. In a refinement of 
the percolation method Klypin & Shandarin ||312|| have shown that two pa- 



rameters are very sensitive diagnostics of the way percolation proceeds: the 
size of the largest cluster (as a function of b) and the mean cluster size (exclud- 
ing the largest one). Instead of using one number to characterise the onset of 
percolation, one therefore uses two functions which describe different aspects 
of the process. It remains to be seen whether this can be a useful technique 
in practice. 

Various statistics have been suggested to search for specifically filamentary 
signatures in the galaxy distribution. Kuhn and Uson [|325|| suggested one 
should use the mean misalignment angle between consecutive steps of a self- 
avoiding random-walk through the distribution. It seems that the effect of 
Poisson noise and sample errors is to easily de-rail the walk from a filament 
so the method is of limited usefulness. A similar problem besets the ridge- 



finding algorithm [397] : just one spurious galaxy position can alter the results 
markedly, so the algorithm is insufficiently robust for realistic data sets. A 
more promising approach is to use the quadrupole moment of the galaxy 



distributions |20Q| , |548|| , or structure functions derived from the moment of 



inertia tensor of the distribution |TjJ which are described below. 

Consider a set of particles located within a radius Rk of a randomly chosen 
k th particle. Let (a = 1, 2, 3) denote the position vector of the j th particle 
relative to the k th particle in this ensemble. One can construct the first and 
second "moments" of the distribution of particles located within a distance R 
of the k th particle: 

M k(R) = W Y,*i> (313) 

iVfc j=1 

Mf{R) = W Y,^y (314) 

JVfc j = 1 

One can then define the local moment of inertia tensor about the centre of 
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mass of each particle by 

lf{R) = Mf{R) - MS(R)Ml(R) 

and its average 



1 N 



(315) 



(316) 



k=l 



where N denotes the number of centers used in carrying out the average. 

Diagonalising I a/3 (R) we determine its eigenvalues Ii,i2>^3 (arranged in de- 
creasing order of magnitude) and the ratio's 



The structure functions S±, S2, S3 are then defined as follows JT3 

Si = sin 



-(1 



S 2 



sin 



|a(/z, v) 



a • / "A* 
63 = sin — 



2 

where p = log 3/ log 1.5, a(/x, v) is given by solving the equation 



a 2 a 2 {l - aa 1 / 3 + pa 2 /*) 
the parameters a and f3 are 

a = 1.9; 
(3 = -Q 9 1 / 3 + aS 1 / 3 . 

These somewhat involved definitions ensure that < Si < 1 for all i. Thus 
(Si, S%, S 3 ) = (0,0,1) for a sphere; (Si, S2, S3) = (0,1,0) for a flat sheet; 
(Si, S2, S3) = (1,0,0) for a filament. Intermediate values of Si indicate the 
departure of a structure from its eikonal shape: sphere, pancake or filament. 



(317) 

(318) 

(319) 
(320) 

(321) 

(322) 
(323) 



For details see 13 



A related measure, also constructed out of the moment of inertia tensor is the 
Vishniac quadrupole statistic [p00|,|548| 



Q k (R) 



k J 



and 



N 



Q(R) = JjEQk( R )i 
k=l 



(324) 



(325) 
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where N is the number of centres chosen in carrying out the average. The 
statistic Q(R) always lies in the range [0,1]. It attains its maximum value 
Q = 1 for a straight line passing through the centre, and is zero for a uniform 
distribution of points. A comparison of Q and the structure functions Si, S 2 , S 3 
is discussed in 1 436|| . Applications of Q to study the growth of filamentarity 



during gravitational clustering are discussed in [[412,4931. 



These techniques, although robust, can still be confused at low signal-to- noise 
levels [|412| , |493|| . One way to improve their performance might be to enhance 



the filamentary pattern before trying to quantify its existence with some nu- 
merical statistic. This might be accomplished by determining Si and Q for 
regions of a specified overdensity [f494|| . Another way would be to preprocess 
the data by constructing the minimal spanning tree (MST) of the set of points 
[436|| . The MST is a geometrical construction which owes its origins to graph 



theory. Consider a distribution of points (say, galaxies) in three-dimensional 
space. One can define a graph on this set to be a collection of nodes (galaxies) 
and edges (lines joining galaxies). The degree of a node is the number of edges 
which emerge from it; a sequence of edges joining nodes is called a path; a 
closed path is a circuit; if any two nodes within a graph have at least one path 
connecting them then such a graph is said to be connected. A connected graph 
which has no circuits is called a tree; if a tree contains all the nodes within a 
graph then it is a spanning tree. The minimal spanning tree of a data set is 
the tree whose total edge length is the smallest. (A tree is said to be unique 
if no two edge lengths within it are equal.) 

It is sometimes convenient to modify the MST of a data set in order to elim- 
inate accidental linkages which can be quite common if the data set is noisy. 
One way of doing this is by the operations of pruning and separating which 
are illustrated in Figure ( ^0[) and are carried out as follows: we define a k 
branch to be a path of k edges connecting a node of degree 1 to a node of 
degree exceeding 2, in such a way that all intervening nodes are of degree 2. A 
MST is said to be pruned to a level p if all fc-branches with k < p have been 
removed. Pruning serves to remove extraneous 'foliage' on the MST which is 
not directly related to the main structure of the pattern. One can also sep- 
arate a tree by removing any edges whose length exceeds some cut-off scale 
lo usually defined in terms of the mean length of edges in the MST < I >. 
Separation tends to minimise 'accidental' linkages, the tree after separation 
consists of several disjoint pieces which can be analysed further. 

An advantage of the MST is that it emphasises filaments embedded in back- 
ground noise thus enhancing the signal-to-noise ratio in any quantitative 
statistic such as the structure functions S±, S 2 , S 3 and Q | |436| | . A simple algo- 



rithm of constructing the MST, the so-called Greedy Algorithm, is described 
in 0. 
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Fig. 20. The Minimal Spanning Tree used to enhance structure in a clustered point 
set. The diagram (a) shows a diagrammatic set of points in two dimensions; (b) 
shows the MST of the set (a); (c) shows the result of pruning - all nodes of degree one 
connected to nodes with degree exceeding two have been removed; (d) a separated, 
pruned tree - all edges of (c) exceeding a certain critical length have been removed. 
Adapted, with permission, from [^]. 

8. 8. Topological Descriptors 

Interesting though the geometry of the galaxy distribution may be, such stud- 
ies do not tell us about the topology of clustering. Or, in other words, its 
connectivity. One is typically interested in the question of how the individual 
filaments, sheets and voids join up and intersect to form the global pattern. Is 
the pattern cellular, having isolated voids surrounded by high-density sheets, 
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or is it more like a sponge in which under- and over-dense regions interlock? 



Looking at 'slice' surveys gives the strong visual impression that we are dealing 
with bubbles; pencil beams perhaps re-inforce this impression by suggesting 
that a line-of-sight intersects at more-or-less regular intervals with walls of 
a cellular pattern. One must be careful of such impressions, however, because 
of elementary topology. Any closed curve in two dimensions must have an 
inside and an outside, so that a slice through a sponge-like distribution will 
appear to exhibit isolated voids just like a slice through a cellular pattern. It is 
important therefore that we quantify this kind of property using well-defined 
topological descriptors. 

The problem of quantifying the topology of large-scale structure has been dis- 
cussed in the literature for many years ||161| , |508| . More recently, in an influen- 



tial series of papers, Gott and collaborators |p20|,p46| , ^22| , |383|| have developed 



a particular method for doing just this. We shall not discuss this approach in 
much detail, because of the existence of an extensive review of this work by 
Melott |S3 . 



Briefly, the method makes use of a topological invariant known as the genus, 
related to the Euler-Poincare characteristic, of the iso-density surfaces of the 
distribution. To extract this from a sample, one must first smooth the galaxy 
distribution with a filter (usually a Gaussian is used) to remove the discrete 
nature of the distribution and produce a continuous density field. By defining 
a threshold level on the continuous field, one can construct excursion sets 
(sets where the field exceeds the threshold level) for various density levels. 
An excursion set will typically consist of a number of regions, some of which 
will be simply connected and others of which will be multiply connected. If 
the density threshold is labelled by v, the number of standard deviations of 
the density away from the mean, then one can construct a graph of the genus 
of the excursion sets at v as a function of v: we call this function G[y). The 
genus can be formally expressed as an integral over the intrinsic curvature K 
of the excursion set surfaces, S v by means of the Gauss - Bonnet theorem: 

4tt [1 - G(v)\ = J KdA, (326) 

where the integral is taken over each compact two-dimensional surface in the 
excursion set. Roughly speaking, the genus for a single surface is the number 
of handles the surface posesses; a sphere has no handles and has zero genus, 
a torus has one and therefore has a genus of one. For technical reasons to do 
with the effect of boundaries, it has become conventional not to use G but 



Gs = G — 1 | p22j| . In terms of this definition, multiply connected surfaces have 
Gs > and simply connected surfaces have Gs < 0. 

To compute topological properties of discrete data sets, it is usually convenient 
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Fig. 21. A polyhedral torus. This figure is adapted from [383]. 

to work with the discrete analogue of equation ( |326| ) which is easy to derive 
once we note that the curvature of a discrete surface is localised in each of its 
N vertices so that 

K{x) ~ D l 5(x - Xi) (327) 

where Di is the deficit angle evaluated at the i th vertex of a polyhedral surface. 
Substituting equation (|327|) in equation (|326|) we obtain 



N 



4tt[1-G(i/)] = X;A 



(328) 



which is the required analog of equation (|326|) . 

To see how equation (|328|) can be used to determine the genus of a compact 
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polyhedral surface, let us consider the simple but illustrative example of a torus 



shown in figure (21). All the curvature of the polyhedral torus is concentrated 
in each of its 16 vertices, the deficit angle at each vertex being Di = 2n — J2i Vi, 
where J2i Vi is the sum of the vertex angles around a given vertex. Thus, each 
of the outer eight corners of the torus contributes a positive deficit angle: 
Di = 2tt — 3tc/2 = it/2, whereas each of the eight inner corners of the torus 
will contribute a negative deficit angle Di = 2ir — 5n/2 = —tt/2. The sum 
total of all the deficit angles is therefore J2}li A = 8 x n/2 — 8 x tt/2 = 0, 
implying G = 1 for a torus. (A numerical algorithm for determining the genus 



of a two or three-dimensional discrete manifold is given in ||383|| 



One of the great advantages of using the genus measure to study large scale 
structure, aside from its robustness to errors in the sample, is that all Gaussian 



density fields have the same form |[L61| j6|, p4~6|| of Gs{v): 

(329) 



Gg(y) = A (l - v 2 ) exp (~j 



This means that if one smooths the field enough to remove the effect of non- 
linear displacements of galaxy positions, the genus curve should look Gaus- 
sian for any model evolved from Gaussian initial conditions, regardless of the 
form of the initial power-spectrum, which only enters through the normal- 
isation factor A in Eqn. ( 329 ). This makes it a potentially powerful test of 



non-Gaussian initial fluctuations, or models which involve non-gravitational 
physics on large scales to form large-scale structure. The observations sup- 
port the interpretation of Gaussian initial conditions, although the distribu- 
tion looks non-Gaussian on smaller scales. The nomenclature for the non- 
Gaussian distortion one sees is a 'meatball shift': non-linear clustering tends 
to produce an excess of high-density simply-connected regions, compared with 
the Gaussian curve. What one would expect to see in the standard picture of 
gravitational instability from Gaussian initial conditions, is therefore a 'meat- 
ball' topology when the smoothing scale is small, changing to a sponge as 
the smoothing scale is increased. This is indeed what seems to be seen in the 
observations; an example is shown in Figure ([22]). 

The smoothing required also poses a problem, however, because present red- 
shift surveys sample space only rather sparsely and one needs to smooth rather 
heavily to construct a continuous field in the first place. A smoothing on scales 
much larger than the scale at which correlations are significant will tend to 
produce a Gaussian distribution by virtue of the central limit theorem. The 
power of this method is therefore limited by the scale of smoothing required 
which, in turn, depends on the space-density of galaxies. This is a particular 



problem in a topological analysis of the QDOT redshift survey ||398| 



Topological information can also be obtained from two-dimensional data sets, 
whether these are simply projected galaxy positions on the sky (such as the 
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Fig. 22. The genus curve is shown for galaxies in the CfA, a small shift towards 
meatball-like topology is discernable. Reproduced, with permission, from [p22[| . 

Lick map, or the APM survey) or 'slices', such as the various CfA compila- 



tions ||159| . There are some subtleties, however. Firstly, as discussed above, 
two-dimensional information topology does not distinguish between 'sponge' 
and 'swiss-cheese' alternatives. Nevertheless, it is possible to assess whether, 
for example, the mean density level (v = 0) is dominated by underdense or 
overdense regions so that one can distinguish swiss-cheese and meatball al- 
ternatives to some extent. The topological quantity usually discussed in two- 
dimensional work is the Euler-Poincare characteristic T of the excursion sets, 
which is roughly speaking the number of disjoint regions minus the number 
of holes in such regions. This is analogous to the genus, but has the interest- 
ing property that it is an odd function of v for a two-dimensional Gaussian 
random field, unlike G[y) which is even. In fact the mean value of T per unit 
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area takes the form 



(r(z/)) = Buexp {-v 2 /2 



(330) 

where B is a constant which depends only on the (two-dimensional) power 
spectrum of the random field P,|118||. Notice that (r) < for v < and 



(T) > for v > 0. A curve shifted to the left with respect to this would 
be a meatball topology, and to the right would be a swiss-cheese (in the re- 
stricted sense described above). The most obviously useful application of this 
method is to look at projected catalogues, the main problem being that if 
the catalogue is very deep, each line of sight contains a superposition of many 
three-dimensional structures. This projection acts to suppress departures from 
Gaussian statistics by virtue of the central limit theorem [ 146|| . Nevertheless, 
useful information is obtainable from projected data because of the sheer size 
of the data sets available [ pl]j22T|JT3Tl , ^55| , |146t|l2^1 ; as is the case with three- 
dimensional studies, the analysis reveals a clear meatball shift which is what 
one expects in the gravitational instability picture. An analysis of the genus 
curve for topological defect toy models was given in |]86| , |223|| . An application 
of the two dimensional genus characteristic to analysing anisotropy in the 
Cosmic Microwave Background is discussed in [|118| . p23| . |519| . |351|| . More sophis- 
ticated generalised morphological measures of large scale structure topology 
and connectedness based on Minkowski functionals have been studied in ||381| . 



8.9. Dimensions and Scaling 



The obvious way to test the idea of multifractal scaling using observations 
is to attempt to extract estimates of the f{ot) spectrum or, equivalently, the 
generalised dimensions D q from the catalogue. Care must be taken when doing 
this, however, because the formal defintions given in Sec. 6.3 require one to 
take the limit of very small box-size. Obviously, when dealing with a point 
process one cannot do this because in this limit all boxes will be empty. One 
therefore looks for scaling over a finite range of box sizes. 

The standard way to extract information from catalogues is via partition func- 
tions. For a distribution which has multifractal scaling properties (see §6.3), 
the function Z(q, r), defined by 

1 N 

Z(q,r) = -Y / nr 1 cxr^\ (331) 
iV i=i 

where r is the size of the cell. By graphing Z against r one can extract r(q) 
for any q and this is related to D q by eq. (|270|) . The Z-partition function does 
not behave well, however, for q < 2 because the low values of q are dominated 
by low-density regions whose statistics are severely distorted by discreteness 
effects, as discussed above. For q < 2 one therefore uses an inverse function - 
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the ^-partition function, defined by 



1 N 

W(r,n) = -Y,ri(n)- T ocn 1 -*. (332) 

8=1 

Here r^(n) is the radius of the smallest sphere centred upon a point i which 
encloses n neighbours. 

Given these two estimators one can extract all the D q from a galaxy catalogue 
280| , |369| , p70| , p68|| . Although this technique is relatively new, the results are 



encouraging. The CfA survey shows clear multifractal scaling properties hav- 
ing, for example, D 2 = 1.3 and Dq = 2.1; a more recent analysis of the QDOT 
survey, which predominantly probes the linear regime, has also good scaling 
properties but quite different to those of CfA: D 2 = 2.77 and D = 2.90. 
Notice that in the monofractal models discussed by, for example, Pietronero 
452|J115| , [116|| all the D q would be identical. It remains to be seen whether 



these analyses will lead to an understanding of the dynamical origin of the 
multifractal scaling observed. 



8.10. Velocity Statistics 



Because peculiar velocity data is known only for a relatively small number 
of galaxies, statistical studies of galaxy motion have been restricted to quite 
simple descriptors, such as the net flow in a given volume ||354 |. For Gaussian 
primordial density fields one can construct useful probability distributions, 
such as 




exp 



3 / v 
2 W 



dv 



(333) 



which is the probability that velocity is observed between v\ and v 2 on a 
smoothing scale R in a randomly-selected volume: o~l is given byf 33 ] 



a 2 v {R) = 47t(# /) 2 J P(k)W 2 {kR) dk. 



(334) 



33 Comparison of a v with the mean square density fluctuation a [see (plj)] , shows that 
large scales are weighed more heavily in the former. For power law spectra P(k) cx k n , 
(Jy(R) oc R~( 1+n \ the observed value of a v (R) can therefore be used to place constraints 
on the value of the spectral index n. 
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where / ~ Q 0M . More information is contained in two-point correlation statis- 
tics of v(x). Firstly, the scalar velocity correlation function: 



oo 

2 



&(r) = (v(x) ■ v(x')) = A7f(H f) 2 / P(k) Jo (kr)dk , (335) 



where jo(x) is a spherical Bessel function of order zero and r = \x — x 
445jg8lTOg29|l . 



Attempts to move further to consider the properties of the velocity correlation 
tensor, which takes into account the direction of v and not just its magnitude, 
were developed in detail by Gorski |214| , |215| . There are a number of subtleties 
involved in velocity statistics that one does not encounter with spatial distri- 
butions so we shall point a couple out here. Let us assume that we have a 
continuous density field and its related velocity field v(x); that is to say, we 
know the velocity at every point x and the density is also defined at x. The 
relevant autocovariance function of v must be a tensor: 

(jfatffa)) = ¥ j (xx,x 2 ). (336) 

Using statistical homogeneity and isotropy, we can decompose the tensor \1/ 
into transverse and longitudinal parts in terms of scalar functions ip± and 

V ij (xi, x 2 ) = ^||(x 12 )nV + if>±{xw) (5 ij - nW') , (337) 

where 

n = (x\ — x 2 ) /x\2- (338) 
If u is any vector satisfying u • n = then 

V>|| = ((n • vi)(n ■ v 2 )) 

1>± = -v 2 )). (339) 

It can be shown that, in the linear regime, VxjJ=0 and there is a consequent 
relationship between the longitudinal and transverse functions: 

^|| = ^ H±(r)) (340) 

One can test whether the actual velocity field obeys this relationship using 
the appropriate covariance tensor. Of course, one needs good peculiar motion 



data and the statistics available at present are not particularly good ||215 
One also only measures a peculiar motion in the radial direction from the 
observer. There is, however, another problem which is spelt out at length by 



Bertschinger [[E|: when one measures galaxy velocities one is not extracting 
a continuous velocity field. This has a complex and poorly-understood effect 
upon the final statistics. 
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A more sophisticated approach to the use of velocity information is provided 
by the POTENT analysis method [44,154, 15511 ; for a recent extensive review 
of this approach see [|153|| . The technique makes use of the fact that in the 
linear theory of gravitational instability the velocity field is curl-free and can 
therefore be expressed as the gradient of a potential. We saw in §2.2 Eqn. (|30| ) 
that this velocity potential turns out to be simply proportional to the linear 
theory value of the gravitational potential. Because the velocity field is the 
gradient of a potential $, one can use the purely radial motions revealed by 
redshift and distance information to map $ in three dimensions: 



A$(r,0,. 



v r (r , 6, 4>) dr. 



(341) 



Once the potential has been mapped, one can recover the remaining compo- 
nents of v a by differentiating $ along transverse directions. One can now solve 
for the density field using the linearised continuity equation 



6 



1 



afH 



V ■ v 



(342) 



or its quasi-linear generalisation |B3] 



3afH 



1 3/2 



V • v 



(343) 



Since the velocity field probes the underlying distribution of both dark and 
luminous matter, the POTENT approximation provides a potentially powerful 
means for determining Q. An extension of this techique using the Zel'dovich 
approximation instead of linear theory is discussed in | [415| ] (see §4.2). Great 
things are claimed of the POTENT analysis, but one should be cautious of 
these until larger and more reliable peculiar velocity data sets are available. 



From the continuity equation, eq. (|T9| ) or fl2~T|) one sees that the velocity field 
is more sensitive to the presence of longer wavelengths in the primordial spec- 
trum than the density field. We therefore expect the velocity field to show 
smaller departures from linear behaviour than the density field when both are 
smoothed over identical scales. It is therefore useful to extend the quasilinear 
analysis of §4.1 and §4.7 to velocity fields. We define for this purpose a scalar 
velocity variable 8(x) = (l/H)Vv(x), in terms of which the skewness and the 
kurtosis of the velocity field have the form 



T 3 
T 4 



9 2 ) 2 

34 \ 



D2\2 



Q2\3 



(344) 
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(the angular brackets denote an ensemble average). For Gaussian fields, T 3 = 
T 4 = 0. In the weakly nonlinear regime, gravity induces a departure from the 
initial Gaussianity which results in the following expressions for T 3 , T 4 , for 



scale free initial spectra P(k) oc k n with top-hat final smoothing ||283| , |36"l , f403" 



T 3 
T 4 



n 



1.2 



26 

y ' 

12085 
441 



(n + 3) 
338 



7 



— (n + 3) + -(n + 3) 2 



(345) 



If n is known then equation's ( |345| ) can be used to determine the value of Q. 
On the other hand, the ratio T 4 /T 3 2 is virtually independent of Q : 



T 4 _ (12088/441) - (338/21)(n + 3) + (7/3)(n + 3f 
7?" [(26/7) -(n + 3)] 2 



(346) 



and can therefore be used to test the gravitational instability hypothesis if 
the value of n is known independently [j36|1 . For instance galaxy survey's seem 
to indicate — 2 < n < — 1 on cluster scales (see §8.4), which leads to the 
prediction: 1.55 < T 4 /T 3 2 < 1.85, for data smoothed on such scales. 

These above methods of course require one to know peculiar motions for a 
sample of galaxies. As we have mentioned previously, one can still use the 
existence of peculiar velocities in a statistical sense even if one cannot mea- 
sure a velocity for each individual source. Correlations and power-spectra in 
particular show a characteristic distortion when they are viewed in redshift 
space rather than real space. The two-point correlation function will, for ex- 
ample, be elongated in the radial direction. Given the paucity of available 
peculiar velocity data, it seems to us that this type of analysis is the most 
promising approach to the use of cosmological velocity information at present 



9. Discussion and Conclusions 



In this review we have attempted to give a comprehensive account of the many 
facets of gravitational instability, focussing particularly upon its dynamical 
and statistical aspects. 

It is well established that gravitational instability proceeds along several dis- 
tinct epochs over which its characteristic features change continuously. It is 
perhaps helpful, by way of a summary, to delineate these epochs and how they 
can be described by the various techniques we have reviewed in this paper. 

During the first epoch density fluctuations are much smaller than unity and 
consequently an adequate description is provided by linear theory. During this 
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epoch fluctuations, as long as the thermal velocity of matter is small, evolve 
self-similarly 5(x, t) oc A + (t)5(x), as a result of which the fluctuation spectrum 
retains many of its original features, particularly the presence or absence of 
phase correlations. 

Linear theory begins to break down when 8 ~ 1, marking the onset of the 
quasi-linear, or weakly non-linear, epoch. During this epoch, mode-mode cou- 
pling between the different Fourier components of the density contrast be- 
comes important, resulting in the build-up of phase correlations. Non-Gaussian 
features in the density field then become important, even when the initial fluc- 
tuations are Gaussian. 

Our understanding of the strongly non-linear regime has undergone a radical 
change during the last two decades. Two decades ago the top-down model of 
galaxy formation suggested by Zeldovich and co-workers in connection with 
adiabatic models of structure formation and the bottom-up, or hierarchical, 
model suggested by Peebles in relation to isothermal models of galaxy for- 
mation were regarded as being mutually exclusive. The former produced cor- 
related structures on very large scales whereas clustering in the latter was 
essentially featureless and self-similar. The first hints that this rigid demarca- 
tion of the two scenarios was not, in fact, correct appeared in the early eighties 
in studies involving non-baryonic models (such as CDM). These models deve- 
lope structures from small to large scales and so might be called hierarchical 
and yet displayed many large-scale features usually associated with pancake 
models. These results received further confirmation from high resolution N- 
body simulations and non-linear approximations such as the adhesion model. 
It is now widely believed that pancakes form generically in models of hierar- 
chical clustering, the precise size of the pancake being determined by the form 
of the primordial fluctuation spectrum. 

It is interesting that pancakes, though prominent at the time of appearance, 
are actually best regarded as intermediate asymptotes which fade out and 
ultimately disappear as matter within them moves towards filaments (which 
form at the intersections of pancakes) and then along filaments towards clumps 
(which form where filaments intersect). The period during which pancakes, 
filaments and clumps coexist may be called the cellular epoch. Once most of the 
matter has collected in clumps, gravitational instability proceeds hierarchically 
as neighboring clumps attract one another, finally merging to form more and 
more massive aggregates of gravitationally bound matter. This later epoch 
of gravitational instability is characterised by the gradual flow of power from 
smaller to larger scales accompanied by the elimination of small voids and 
also, occasionally, by the emergence of large coherent structures - secondary 
pancakes - in models whose primordial spectra have signficant large scale 
power (including CDM). 
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Clearly the physics of the different epochs characterising gravitational insta- 
bility will vary continuously as will the statistical properties of the Universe 
during these separate epochs. It is here that non-linear approximations to 
gravitational dynamics accompanied by statistical methods play a vital role 
in the development of our understanding of these processes, and how they 
apply to the real Universe. 

While some features of gravitational instability, particularly its dynamical 
aspects, are reasonably well understood, others, including a comprehensive 
statistical treatment (such as the one based upon the BBGKY hierarchy) are 
still in the developmental stage. Consequently, some quasi-analytical treat- 
ments which can adequately address certain dynamical aspects of clustering 
may also can fail to come up with satisfactory answers to statistical questions 
and vice versa. An illustration of this is the Press- Schechter approximation 
which, while accounting fairly accurately for the mass (or multiplicity) func- 
tion of bound objects such as galaxies, fails to say anything at all about the 
large scale clustering properties of the distribution. 

It is becoming widely acknowledged that large scale features in the galaxy dis- 
tribution such as superclusters and voids are real physical entities and can no 
longer be regarded solely as products of a visual system sensitive to detecting 
filamentarity. As a result there is a growing need for statistical measures that 
can provide a qualitative description of such large scale features as exist in 
the galaxy distribution. It is also realised that whereas statistical descriptors 
such as the two point correlation function are very useful in characterising the 
clustering properties of the galaxy distribution on small scales, they say next 
to nothing about the over-all clustering pattern of galaxies in the Universe 
(whether bubble-like or sponge-like) and must therefore be complemented by 
other statistical measures some of which are discussed in §8. The development 
of analytical techniques to follow the evolution of structure must therefore be 
accompanied by a comprehensive understanding of the behaviour of the var- 
ious pattern diagnostics when applied to different degrees of evolution from 
different primordial power spectra. 

We have concentrated upon analytical approaches in this article, but there 
are of course many situations were an analytic treatment is not practical, 
particularly if one seeks a detailed comparison of evolved distributions with 
observational properties. We therefore fully acknowledge the enormous value 
of computational methods in this kind of study. Without the iV-body sim- 
ulation, progress in this area during the last two decades would have been 
limited indeed. Nevertheless, simulating the process of structure formation is 
not the same as understanding it and we feel most strongly that analytical 
studies should be the focus of much more theoretical effort in the forthcoming 
years. We have shown that much can be explained and understood using quite 
simple mathematical techiques, and that this is an extremely active area. As 
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further observational data are obtained, about the clustering of galaxies and 
the primordial fluctuations (from the cosmic microwave background), to com- 
plement these theoretical analyses, there is good reason to be optimistic that 
we will, before long, be able to understand the large-scale structure of the 
Universe. 



Acknowledgement 

The authors acknowledge stimulating conversations with Robert Branden- 
berger, John Dubinsky, Jim Fry, Nikolai Gnedin, Salman Habib, Alan Heav- 
ens, Anatoli Klypin, Lev Kofman, Bob Mann, Adrian Melott, Dipak Munshi, 
Dima Pogosyan, Dave Salopek, B.S. Sathyaprakash, Sergei Shandarin, Tarun 
Souradeep and Alexei Starobinsky. We also thank Ed Bertschinger for his de- 
tailed comments on an earlier version of this paper. Peter Coles receives a 
PPARC Advanced Research Fellowship and thanks PPARC for travel sup- 
port for the visit to IUCAA during which most of this paper was written. He 
also thanks IUCAA for hospitality. Both authors thank S.S. Pawar for help in 
preparing the figures. 



References 

[1] S.J. Aarseth, in: J. Brackbill and B.I. Cohen (eds), Methods of Computational 
Physics (Academic Press, New York, 1984) p. 1. 

[2] S.J. Aarseth, J.R. Gott and E.L. Turner, Astrophys. J., 236 (1979) 43. 

[3] L.F. Abbott and M.B. Wise, Nucl. Phys. B., 135 (1984) 279. 

[4] GO. Abell, Astrophys. J. SuppL, 3 (1958) 211. 

[5] GO. Abell, H.G. Corwin and R.P. Olowin, Astrophys. J. SuppL, 70 (1989) 1. 

[6] R.J. Adler, The Geometry of Random Fields (Wiley, New York, 1981). 

[7] J. Aitchison and J.A.C. Brown, The Lognormal Distribution with special 
reference to its uses in Economics (Cambridge University Press, Cambridge, 
1969). 

[8] L. Appel and B.J.T. Jones, Mon. Not. R. astr. Soc, 245 (1990) 522. 

[9] V.I. Arnol'd, S.F. Shandarin and Ya.B. Zel'dovich, Geophys. Astrophys. Fluid. 
Dyn., 20 (1982) 111. 

[10] V.I. Arnol'd, Catastrophe Theory (Springer, Berlin, 1976). 



151 



[11] V.I. Arnol'd, S.M. Gusein-Sade, A.N. Varchenko Singularities of Differentiable 
Maps, vol. 1 (Birkhauser, Boston, 1985). 



12 
13 
14 
15 
16 
17 
18 
19 

20 
21 
22 
23 

24 
25 
26 

27; 

28 
29 

30 
31 

32 
33 
34 
35 
36 



A. Babul, Astrophys. J., 349 (1990) 429. 

A. Babul and G.D. Starkman, Astrophys. J., 401 (1992) 28. 

A. Babul and R. van de Weygaert, CITA preprint, Nature submitted (1994). 

J.S. Bagla and T. Padmanabhan, Mon. Not. R. astr. Soc, 266 (1994) 227. 

N.A. Bahcall, Ann. Rev. Astron. Astrophys. 26 (1988) 631. 

N.A. Bahcall and R.M. Soneira, Astrophys. J., 270 (1983) 20. 

N.A. Bahcall and M.J. West, Astrophys. J., 392 (1992) 419. 

R. Balescu, Equilibrium and N on- equilibrium Statistical Mechanics (Wiley, 
New York, 1975) 

R. Balian and R. Schaeffer, Astr. Astrophys., 220 (1989) 1. 
R. Balian and R. Schaeffer, Astr. Astrophys., 226 (1989) 373. 
J.M. Bardeen, Phys. Rev. D, 22 (1980) 1882. 

J.M. Bardeen, J.R. Bond, N. Kaiser and A.S. Szalay, Astrophys. J., 304 (1986) 
15. 

J. Barnes and P. Hut, Nature, 324 (1986) 446. 

J.D. Barrow and S.P. Bhavsar, Quart. J. R. astr. Soc, 28 (1987) 109. 

J.D. Barrow, S.P. Bhavsar and D.H. Sonoda, Mon. Not. R. astr. Soc, 210 

(1984) 19P. 

J.D. Barrow, S.P. Bhavsar and D.H. Sonoda, Mon. Not. R. astr. Soc, 216 

(1985) 17. 

J.D. Barrow and P. Saich, Class. Quant. Grav., 10 (1993) 79. 

H. Bateman and A. Erdelyi, Higher Transcendental Functions, Vol. 1 
(McGraw-Hill, New York, 1953). 

CM. Baugh and G. Efstathiou, Mon. Not. R. astr. Soc, 265 (1993) 165. 

CM. Baugh and G. Efstathiou, Mon. Not. R. astr. Soc, 267 (1994) 323. 

D.J. Baumgart and J.N. Fry, Astrophys. J., 375 (1991) 25. 

F. Bernardeau, Astrophys. J., 390 (1992) L61. 

F. Bernardeau, Astrophys. J., 392 (1992) 1. 

F. Bernardeau, Astrophys. J., 427 (1994) 51. 

F. Bernardeau, Astrophys. J., 433 (1994) 1. 



152 



[37] F. Bernardeau and L.A. Kofman, Astrophys. J., in press (1994). 

[38] F. Bernardeau, T.P. Singh, B. Banerjee and S.M. Chitre, Mon. Not. R. astr. 
Soc, 269 (1994) 947. 

[39] E. Bertschinger, Astrophys. J., 268 (1983) 17. 

[40] E. Bertschinger, Astrophys. J. Suppl, 58 (1985) 1. 

[41] E. Bertschinger, Astrophys. J. Suppl, 58 (1985) 39. 

[42] E. Bertschinger, Astrophys. J., 316 (1987) 489. 

[43] E. Bertschinger, in: V.J. Martinez, M. Portilla and D. Saez (eds), New Insights 
into the Universe. Proceedings, Valencia, Spain 1991 (Springer, Berlin, 1992) 
p. 65. 

[44] E. Bertschinger and A. Dekel, Astrophys. J., 336 (1989) L5. 

[45] E. Bertschinger, A. Dekel, S.M. Faber, A. Dressier and D. Burstein, Astrophys. 
J., 364 (1990) 370. 

[46] E. Bertschinger and J.M. Gelb, Computers in Physics, 5 (1991) 164. 

[47] E. Bertschinger and B. Jain, Astrophys. J., 431 (1994) 486. 

[48] E. Bertschinger and A.J.S. Hamilton, Astrophys. J., 435 (1994) 1. 

[49] S. Bharadwaj, Astrophys. J., 428 (1994) 419. 

[50] S.P. Bhavsar and J.D. Barrow, Mon. Not. R. astr. Soc, 205 (1983) 61P. 

[51] S.P. Bhavsar and E.N. Ling, Astrophys. J., 331 (1988) L63. 

[52] S. Bildhauer, T. Buchert and M. Kasai, Astron. Astrophys., 263 (1992) 23. 

[53] J. Binney and S. Tremaine, Galactic Dynamics (Princeton University Press, 
Princeton, 1987). 

[54] O. Blaes, J.V. Villumsen and P. Goldreich, Astrophys. J., 361 (1990) 331. 

[55] G.R. Blumenthal, S.M. Faber, J.R. Primack and M.J. Rees, Nature, 311 (1984) 
517. 

[56] G.R. Blumenthal, L.N. da Costa, D.S. Goldwirth, M. Lecar and T. Piran, 
Astrophys. J., 388 (1992) 234. 

[57] H. Bohringer and G. Wiedenmann, in: V.J. Martinez, M. Portilla and D. 

Saez (eds), New Insights into the Universe. Proceedings, Valencia, Spain 1991 
(Springer, Berlin, 1992) p. 127. 

[58] J.R. Bond, in: M. Sasaki (ed), Relativistic Cosmology, (Academic Press 1994), 
p. 23. 

[59] J.R. Bond, S. Cole, G. Efstathiou and N. Kaiser, Astrophys. J., 379 (1991) 
440. 



153 



[60] J.R. Bond and H.M.P. Couchman, in: A. A. Coley, C.C. Dyer and B.O.J. 
Tupper (eds), Proceedings of the Second Canadian Conference on General 
Relativity and Relativistic Astrophysics (World Scientific, Singapore, 1988), 
p. 385. 

[61] J.R. Bond, J. Centrella, A.S. Szalay and J.R. Wilson, Mon. Not. R. astr. Soc, 
210 (1984) 515. 



[62; 

[63 
[64 
[65; 

[66 

[67; 
[68; 

[69 
[70 
[71 
[72 

[73 
[74 

[75; 

[76' 

[77; 

[78; 
[79; 

[80 
[81 

[82; 



J.R. Bond and G. Efstathiou, Astrophys. J., 285 (1984) L45. 

J.R. Bond and A.S. Szalay, Astrophys. J., 277 (1983) 443. 

J.R. Bond, A.S. Szalay and J. Silk, Astrophys. J., 324 (1988) 627. 

J.R. Bond, R. Crittenden, R.L. Davis, G. Efstathiou and P.J. Steinhardt, Phys. 
Rev. Lett. 72 (1994) 13. 

J.R. Bond and S.T. Myers, CITA preprint (1994). 

J.R. Bond and S.T. Myers, CITA preprint (1994). 

J.R. Bond and S.T. Myers, CITA preprint (1994). 

J.R. Bond and S.T. Myers, CITA preprint (1994). 

H. Bondi, Mon. Not. R. astr. Soc, 107 (1947) 410. 

W.B. Bonnor and A. Chamorro, Astrophys. J., 361 (1990) 21. 

S.A. Bonometto, A. Iovino, L. Guzzo, R. Giovanelli and M.P.. Haynes, 
Astrophys. J., 419 (1993), 451. 

S.A. Bonometto, F. Lucchin and S. Matarrese, Astrophys. J., 323 (1987) 19. 
S. Borgani, Physics Reports, to be published (1994). 

S. Borgani, P. Coles and L. Moscardini, Mon. Not. R. astr. Soc, 271 (1994) 
223. 

F.R. Bouchet and L. Hernquist, Astrophys. J. Suppl., 68 (1988) 521. 

F.R. Bouchet and L. Hernquist, Astrophys. J., 400 (1992), 25. 

F.R. Bouchet, R. Juszkiewicz, S. Colombi and R. Pellat, Astrophys. J., 394 
(1992) L5. 

F.R. Bouchet, R. Schaeffer and M. Davis, Astrophys. J., 383 (1991) 19. 

F.R. Bouchet, M.A. Strauss, M. Davis, K.B. Fisher, A. Yahil and J.P. Huchra, 
Astrophys. J., 417 (1993) 36. 

F.R. Bouchet, S. Colombi, E. Hivon and R. Juszkiewicz, Astr. Astrophys. 
submitted (1994). 

R.G. Bower, Mon. Not. R. astr. Soc, 248 (1991) 332. 



154 



[83] R.G. Bower, P. Coles, C.S. Frenk and S.D.M. White, Astrophys. J., 405 (1993) 
403. 

[84] T.G. Brainerd, R.J. Scherrer and J. V. Villumsen, Astrophys. J. 418 (1993) 
570. 

[85] E. Branchini and R.G. Carlberg, Astrophys. J., 434 (1994) 37. 

[86] R. Brandenberger, D. Kaplan and S. Ramsey, Brown University Preprint, 
Brown-HET-922 (1993) 

[87] T.J. Broadhurst, R.S. Ellis, D.C. Koo and A.S. Szalay, Nature, 343 (1990) 
726. 

[88] J.W. Bruce, J. Lond. Math. Soc. (2), 33 (1986) 375. 

[89] M. Bruni, P.K.S. Dunsby and G.F.R. Ellis, Astrophys. J., 395 (1992) 34. 

[90] M. Bruni, S. Matarrese and O. Pantano, Astrophys. J., submitted (1994). 

[91] T. Buchert, Astron. Astrophys., 223 (1989) 9. 

[92] T. Buchert, Mon. Not. R. astr. Soc, 254 (1992) 729. 

[93] T. Buchert, Astron. Astrophys., 267 (1993) L51. 

[94] T. Buchert, Mon. Not. R. astr. Soc, 267 (1994) 811. 

[95] T. Buchert and J. Ehlers, Mon. Not. R. astr. Soc, 264 (1993) 375. 

[96] T. Buchert, A.L. Melott, and A.G. Weiss Astron. Astrophys., 288 (1994) 349. 

[97] J.M. Burgers, The Non-linear Diffusion Equation (Reidel, Dordrecht, 1974). 

[98] D. Burstein, Rep. Prog. Phys., 53 (1990) 421. 

[99] R.G. Carlberg and H.M.P. Couchman, Astrophys. J. 340 (1989) 47. 

[100] D.E. Cartwright and M.S. Longuet-Higgins, Proc. Roy. Soc Lond. Ser. A., 
237 (1956) 212. 

[101] P. Catelan, Mon. Not. R. astr. Soc, submitted (1994). 

[102] P. Catelan, P.Coles, S. Matarrese and L. Moscardini, Mon. Not. R. astr. Soc, 
268 (1994) 966. 

[103] P. Catelan, F. Lucchin, S. Matarrese and L. Moscardini, Mon. Not. R. astr. 
Soc, submitted (1994) 

[104] P. Catelan and L. Moscardini, Astrophys. J. 426 (1994) 14. 

[105] A. Cavaliere and N. Menci, Astrophys. J., 435 (1994) 528. 

[106] R. Cen, Astrophys. J. SuppL, 78 (1992) 341. 

[107] J.M. Centrella and A.L. Melott Nature, 305 (1983) 196. 



155 



[108] A. Chamorro, Astrophys. J. 383 (1991) 51. 

[109] J.M. Cline, H.D. Politzer, S.-J. Rey and M.B. Wise, Comm. Math. Phys., 112 
(1987) 217. 

[110] S. Cole, Astrophys. J., 367 (1991) 45. 

[Ill] S. Cole, A. Aragon-Salamanca, C.S. Frenk, J.F. Navarro and S.E. Zepf, Mon. 
Not. R. astr. Soc, 271 (1994) 781. 

[112] S. Cole, K.B. Fisher and D.H. Weinberg, Mon. Not. R. astr. Soc, 267 (1994) 
785. 

[113] S. Cole and N. Kaiser, Mon. Not. R. astr. Soc, 233 (1988) 637. 

[114] S. Cole and N. Kaiser, Mon. Not. R. astr. Soc, 237 (1989) 1127. 

[115] P.H. Coleman, L. Pietronero and R.H. Sanders, Astr. Astrophys., 200, L32. 

[116] P.H. Coleman and L. Pietronero, Phys. Rep., 213 (1992) 311. 

[117] P. Coles, Mon. Not. R. astr. Soc, 222 (1986) 9P. 

[118] P. Coles, Mon. Not. R. astr. Soc, 234 (1988) 509. 

[119] P. Coles, Mon. Not. R. astr. Soc, 238 (1989) 319. 

[120] P. Coles, Mon. Not. R. astr. Soc, 243 (1990) 171. 

[121] P. Coles, in: E.D. Feigelson and G.J. Babu, Statistical Challenges in Modern 
Astronomy (Springer, New York, 1992) p. 57. 

[122] P. Coles, Mon. Not. R. astr. Soc, 262 (1993) 1065. 

[123] P. Coles and J.D. Barrow, Mon. Not. R. astr. Soc, 228 (1987) 407. 

[124] P. Coles and A.G. Davies, Mon. Not. R. astr. Soc, 264 (1993) 261. 

[125] P. Coles and G.F.R. Ellis, Nature, 370 (1994) 609. 

[126] P. Coles and C.S. Frenk, Mon. Not. R. astr. Soc, 253 (1991) 727. 

[127] P. Coles and B.J.T. Jones, Mon. Not. R. astr. Soc, 248 (1991) 1. 

[128] P. Coles, A.L. Melott and S.F. Shandarin, Mon. Not. R. astr. Soc, 260 (1993) 
765. 

[129] P. Coles, L. Moscardini, M. Plionis, F. Lucchin, S. Matarrese and A. Messina, 
Mon. Not. R. astr. Soc, 260 (1993) 572. 

[130] P. Coles, L. Moscardini, F. Lucchin, S. Matarrese and A. Messina, Mon. Not. 
R. astr. Soc, 264 (1993) 749. 

[131] P. Coles and M. Plionis, Mon. Not. R. astr. Soc, 250 (1991) 75. 

[132] C.A. Collins, R.C. Nichol and S.L. Lumsden, Mon. Not. R. astr. Soc, 254 
(1992) 295. 



156 



[133] S. Colombi, Astrophys. J., 435 (1994) 536. 

[134] H.M.P. Couchman, Mon. Not. R. astr. Soc, 225 (1987) 777. 

[135] H.M.P. Couchman, Mon. Not. R. astr. Soc, 225 (1987) 795. 

[136] H.M.P. Couchman, Astrophys. J., 368 (1991) L23. 

[137] P. Crane and W.C. Saslaw, Astrophys. J., 301 (1986) 1. 

[138] R. Crittenden, J.R. Bond, R.L. Davis, G. Efstathiou and P.J. Steinhardt, Phys. 
Rev. Lett., 71 (1993) 324. 

[139] K.M. Croudace, J. Parry, D.S. Salopek and J.M. Stewart, Astrophys. J, 423 
(1994) 22 

[140] E.L. Crow and K. Shimizu (eds), Lognormal Distributions: Theory and 
Applications (Marcel Dekker, New York, 1988). 

[141] L.N. da Costa, M.J. Geller, P.S. Pellegrini, D.W. Latham, A.P. Fairall, R.D. 
Marzhe, C.N.A. Miller, J. P. Huchra, J.H. Calderon, M. Ramella and M.I. 
Kurtz, Astrophys. J., 424 (1994) LI. 

[142] L.N. da Costa, P.S. Pellegrini, W.L.W. Sargent, J. Tonry, M. Davis, A. Meiksin, 
D.W. Latham, J.W. Menzies and L.A. Coulson, Astrophys. J., 337 (1988) 544. 

[143] L.N. da Costa, in: Cosmic Velocity Fields, Proceedings of the ninth IAP 
Astrophysics meeting; F.R. Bouchet and M. Lachieze-Rey (eds) (Editions 
Frontieres 1994) p. 475. 

[144] L.N. da Costa, M.S. Vogeley, M.J. Geller, J. P. Huchra and C. Park, Astrophys. 
J. Lett, in press (1994). 

[145] G.B. Dalton, G. Efstathiou, S.J. Maddox and W.J. Sutherland, Astrophys. J., 
390 (1992) LI. 

[146] A.G. Davies and P. Coles, Mon. Not. R. astr. Soc, 260 (1993) 553. 

[147] A.G. Davies and P. Coles, Mon. Not. R. astr. Soc, 262 (1993) 591. 

[148] M. Davis, G. Efstathiou, C.S. Frenk and S.D.M. White, 292 (1985) 371. 

[149] M. Davis, J. P. Huchra, D.W. Latham and J.R. Tonry, Astrophys. J., 253 
(1982) 423. 

[150] M. Davis and P.J.E. Peebles, Astrophys. J. Suppl, 34 (1977) 425. 

[151] M. Davis and P.J.E. Peebles, Astrophys. J., 267 (1983) 465. 

[152] R.L. Davis, H.M. Hodges, G.F. Smoot, P.J. Steinhardt and M.S. Turner, Phys. 
Rev. Lett, 69 (1992) 1856. 

[153] A. Dekel, Ann. Rev. Astron. Astrophys., 32 (1994) 371. 

[154] A. Dekel, E. Bertschinger and S.M. Faber, Astrophys. J., 364 (1990) 349. 



157 



[155] A. Dekel, E. Bertschinger, A. Yahil, M.A. Strauss, M. Davis and J.P. Huchra, 
Astrophys. J., 412 (1993) 1. 

[156] A. Dekel, G.R. Blumenthal, J.R. Primack and S. Olivier, Astrophys. J., 338 
(1989) L5. 

[157] A. Dekel and M.J. Rees, Nature, 326 (1987) 455. 

[158] A. Dekel and M.J. West, Astrophys. J., 288 (1985) 411. 

[159] V. de Lapparent, M.J. Geller and J.P. Huchra, Astrophys. J., 302 (1986) LI. 
[160] V. de Lapparent, M.J. Geller and J.P. Huchra, Astrophys. J., 343 (1989) 1. 
[161] A.G. Doroshkevich, Astrofizica, 6 (1970) 320. 
[162] A.G. Doroshkevich, Astrophys. Lett, 14 (1973) 11. 

[163] A.G. Doroshkevich and E.V. Kotok, Mon. Not. R. astr. Soc, 246 (1990) 10. 

[164] A.G. Doroshkevich, V.S. Ryabenkii and S.F. Shandarin, Astrofizika, 9 (1973) 
144. 

[165] A.G. Doroshkevich, S.F. Shandarin and E. Saar, Mon. Not. R. astr. Soc., 246 
(1978) 10. 

[166] A.G. Doroshkevich, E.V. Kotok, I.D. Novikov, A.N. Polyudov, S.F. Shandarin 
and Yu.S. Sigov, Mon. Not. R. astr. Soc., 192 (1980) 321. 

[167] R. Durrer and I. Kovner, Astrophys. J., 356 (1990) 49. 

[168] R. Durrer and N. Straumann, Mon. Not. R. astr. Soc., 242 (1990) 221. 

[169] G. Efstathiou, in: J. A. Peacock, A.F. Heavens and A.T. Davies (eds), Physics 
of the Early Universe. Proceedings of the 36th Scottish Universities Summer 
School in Physics (Adam Hilger, Bristol, 1990) p. 361. 

[170] G. Efstathiou and J.R. Bond, Mon. Not. R. astr. Soc, 218 (1986) 103. 

[171] G. Efstathiou, J.R. Bond and S.D.M. White, Mon. Not. R. astr. Soc, 258 
(1992) IP 

[172] G. Efstathiou, G.B. Dalton, W. Sutherland and S.J. Maddox, Mon. Not. R. 
astr. Soc, 257 (1992) 125. 

[173] G. Efstathiou, M. Davis, C.S. Frenk and S.D.M. White, Astrophys. J. Suppl., 
57 (1985) 241. 

[174] G. Efstathiou and J.W. Eastwood, Mon. Not. R. astr. Soc, 194 (1981) 503. 

[175] G. Efstathiou, S.M. Fall and C. Hogan, Mon. Not. R. astr. Soc, 189 (1979) 
203. 

[176] G. Efstathiou, C.S. Frenk, S.D.M. White and M. Davis, Mon. Not. R. astr. 
Soc, 235 (1988) 715. 



158 



[177] G. Efstathiou, N. Kaiser, W. Saunders, A. Lawrence, M. Rowan-Robinson and 
C.S. Frenk, Mon. Not. R. astr. Soc, 247 (1990) 10R 



[178; 

[179 
[180 

[181 

[182; 
[183 

[184 
[185; 
[186' 
[187; 

[188 
[189 
[190 
[191 
[192 
[193 
[194 

[195 

[196 

[197; 

[198 
[199 
[200 
[201 



G. Efstathiou and M.J. Rees, Mon. Not. R. astr. Soc, 230 (1988) 5P. 

J. Einasto, M. Joeveer and E. Saar, Mon. Not. R. astr. Soc, 193 (1980) 353. 

J. Einasto, J. Einasto, M. Gramann and E. Saar, Mon. Not. R. astr. Soc, 248 
(1991) 593. 

G.F.R. Ellis, in: R.K. Sachs (ed), General Relativity and Cosmology (Academic 
Press, New York, 1971), p. 104. 

G.F.R. Ellis and M. Bruni, Phys. Rev. D, 40 (1989) 1804. 

G.F.R. Ellis and P.K.S. Dunsby, Cape Town preprint, Astrophys. J. submitted 
(1994). 

G. F.R. Ellis and W.J. Stoeger, Class. Qu. Grav., 4 (1987) 1679. 
A.E. Evrard, Mon. Not. R. astr. Soc, 235 (1988) 911. 

A.E. Evrard, F. Summers and M. Davis, Astrophys. J., 422 (1994) 11. 

S.M. Faber and D. Burstein, in: V.C. Rubin and G.V. Coyne (eds), Large-scale 
Motions in the Universe (Princeton University Press, Princeton, 1988) p. 115. 

D. Fargion, Nuovo Cimento B., 77 (1983) 111. 

K. Farrar and A.L. Melott, Computers in Physics, 4 (1990) 185. 

H. A. Feldman and A.E. Evrard, Int. J. Mod. Phys. D., 2 (1993) 113. 
H.A. Feldman, N. Kaiser and J.A. Peacock, Astrophys. J., 426 (1994) 23. 
J.A. Fillmore and P. Goldreich, Astrophys. J., 281 (1984) 1. 

J.A. Fillmore and P. Goldreich, Astrophys. J., 281 (1984) 9. 

K.B. Fisher, M. Davis, M.A. Strauss, A. Yahil and J. P. Huchra, Astrophys. J., 
402 (1993) 42. 

K.B. Fisher, M. Davis, M.A. Strauss, A. Yahil and J.P. Huchra, Mon. Not. R. 
astr. Soc, 267 (1994) 927. 

J.N. Fry, Astrophys. J., 262 (1982) 424. 

J.N. Fry, Astrophys. J., 277 (1984) L5. 

J.N. Fry, Astrophys. J., 279 (1984) 499. 

J.N. Fry, Astrophys. J., 279 (1984) 729. 

J.N. Fry, Astrophys. J., 289 (1985) 10. 

J.N. Fry, Phys. Lett, 158B (1985) 211. 



159 



[202] J.N. Fry, Astrophys. J., 306 (1986) 358. 
[203] J.N. Fry, Astrophys. J., 421 (1994) 21. 

[204] J.N. Fry, R. Giovanelli, M.P. Haynes, A.L. Melott and R.J. Scherrer, Astrophys. 
J., 340 (1989) 11. 

[205] J.N. Fry and A.L. Melott, Astrophys. J., 292 (1985) 395. 

[206] J.N. Fry and R.J. Scherrer, Astrophys. </., to be published (1993) 

[207] J.N. Fry, A.L. Melott and S.F. Shandarin, Astrophys. J., 412 (1993) 504. 

[208] E. Gaztanaga, Astrophys. J., 398 (1992) L17. 

[209] E. Gaztanaga, Mon. Not. R. astr. Soc, 268 (1994) 913. 

[210] E. Gaztanaga and J. A. Frieman, Fermilab preprint 207-A (1994) 

[211] M. Giavalisco, B. Mancinelli, P.J. Mancinelli and A. Yahil, Astrophys. J., 411 
(1993) 9. 

[212] R. Giovanelli and M.P. Haynes, Ann. Rev. Astr. Astrophys., 29 (1991) 499. 

[213] M.H. Goroff, B. Grinstein, S.-J. Rey and M.B. Wise, Astrophys. J., 311 (1986) 
6. 

[214] K. Gorski, Astrophys. J., 332 (1988) L7. 

[215] K. Gorski, M. Davis, M.A. Strauss, S.D.M. White and A. Yahil, Astrophys. J., 
344 (1989) 1. 

[216] K. Gorski, J. Silk and N. Vittorio, Phys. Rev. Lett. 68 (1992) 733. 

[217] K. Gorski, G. Hindshaw, A.J. Banday, C.L. Bennett and E.L. Wright, 
Astrophys. J. Lett. (1994) submitted. 

[218] J.R. Gott, Astrophys. J., 201 (1975) 296. 

[219] J.R. Gott and M.J. Rees, Astr. Astrophys., 45 (1975), 365. 

[220] J.R. Gott, A.L. Melott and M. Dickinson, Astrophys. J., 306 (1986) 341. 

[221] J.R. Gott, S. Mao, C. Park, O. Lahav, Astrophys. J., 385 (1992) 26. 

[222] J.R. Gott, J. Miller, T.X. Thuan, S.E. Schneider, D.H. Weinberg, C. Garrie, 
K. Polk, M. Vogeley, S. Jeffrey, S.P Bhavsar, A.L. Melott, R. Giovanelli, M.P. 
Haynes, R.B. Tully and A.J.S. Hamilton, Astrophys. J., 340 (1989) 625. 

[223] J.R. Gott, C. Park, R. Juszkiewicz, W.E. Bies, D.P.Bennett, F.R. Bouchet and 
A. Stebbins, Astrophys. J., 352 (1990) 1. 

[224] M. Gramann, Astrophys. J., 405 (1993) L47. 

[225] M. Gramann, Astrophys. J., 405 (1993) 449. 

[226] M. Gramann, R. Cen and N.A. Bahcall, Astrophys. J., 419 (1993) 440. 



160 



[227] B. Grinstein and M.B. Wise, Astrophys. J., 320 (1987) 448. 

[228] L.P. Grischuk and Ya.B. Zel'dovich, Sov. Astr., 25 (1981) 267. 

[229] E.J. Groth, R. Juszkiewicz and J. P. Ostriker, Astrophys. J., 346 (1989) 558. 

[230] E.J. Groth and P.J.E. Peebles, Astr. Astrophys., 41 (1975) 143. 

[231] E.J. Groth and P.J.E. Peebles, Astr. Astrophys., 53 (1976) 131. 

[232] E.J. Groth and P.J.E. Peebles, Astrophys. J., 217 (1977) 385. 

[233] S.F. Gull and A.J.M. Garrett, in: G. J. Erickson (ed), Maximum Entropy and 
Bayesian Methods, Seattle 1991 (Kluwer, Dordrecht, 1992). 

[234] J.E. Gunn, Astrophys. J., 218 (1977) 592. 

[235] J.E. Gunn and J.R. Gott, Astrophys. J., 176 (1972) 1. 

[236] S.N. Gurbatov and A.I. Saichev, Sov. Phys. JETP., 53 (1981) 347. 

[237] S.N. Gurbatov, A.I. Saichev and S.F. Shandarin, Sov. Phys. Dokl., 30 (1985) 
921. 

[238] S.N. Gurbatov, A.I. Saichev and S.F. Shandarin, Mon. Not. R. astr. Soc, 236 
(1989) 385. 

[239] S.N. Gurbatov, A.I. Saichev and I.G. Yakushkin, Sov. Phys. Usp., 26 (1983) 
857. 

[240] M. Guyot and Ya.B. Zel'dovich, Astr. Astrophys., 9 (1970) 227. 

[241] R. Guzman and J.R. Lucey, Mon. Not. R. astr. Soc., 263 (1993) L47. 

[242] A.J.S. Hamilton, Astrophys. J., 332 (1988) 67. 

[243] A.J.S. Hamilton, Astrophys. J., 385 (1992) L5. 

[244] A.J.S. Hamilton, Astrophys. J., 406 (1993) L47. 

[245] A.J.S. Hamilton, Astrophys. J., 417 (1993) 19. 

[246] A.J.S. Hamilton, J.R. Gott and D. Weinberg, Astrophys. J., 309 (1986) 1. 

[247] A.J.S. Hamilton, P. Kumar, E. Lu and A. Mathews, Astrophys. J., 374 (1991) 
LI. 

[248] D. Hansel, F.R. Bouchet, R. Pellat and A. Romani, Astrophys. J., 310 (1986), 
23. 

[249] E. Harrison, Phys. Rev. D., 1 (1970) 2726. 

[250] M.A. Hausman, DW. Olson and B.D. Roth, Astrophys. J., 270 (1983) 351. 

[251] SW. Hawking, Astrophys. J., 145 (1966) 544. 

[252] M.P. Haynes, Ann. Rev. Astr. Astrophys., 29 (1991) 499. 



161 



[253] M.P. Haynes and R. Giovanelli, in: V.C. Rubin and G.V. Coyne (eds), Large- 
scale Motions in the Universe (Princetion University Press, Princeton, 1988) 
p. 31. 

[254] D.J. Heath, Mon. Not. R. astr. Soc, 179 (1977) 351. 

[255] A.F. Heavens and A.N. Taylor, Mon. Not. R. astr. Soc., in press (1995). 

[256] J. P. Henry and K.A. Arnoud, Astrophys. J., 372 (1991) 410. 

[257] L. Hernquist and N. Katz, Astrophys. J. Suppl, 70 (1989) 419. 

[258] E. Hivon, F.R. Bouchet, S. Colombi and R. Juszkiewicz, Astr. Astrophys. 
Sunmitted (1994). 

[259] R.W. Hockney and J.W. Eastwood, Computer Simulation Using Particles 
(McGraw-Hill, New York, 1981). 

[260] G.L. Hoffman, E.E. Salpeter and I. Wasserman, Astrophys. J., 268 (1983) 527. 

[261] Y. Hoffman, Astrophys. J., 308 (1986) 493. 

[262] Y. Hoffman, Astrophys. J., 340 (1989) 69. 

[263] Y. Hoffman and S. Bludman, Phys. Rev. Lett. 52 (1984) 1087. 
[264] J. Holtzman, Astrophys. J. Suppl, 71 (1989) 1. 
[265] E. Hubble, Astrophys. J., 79 (1934) 8. 

[266] J.-C. Hwang and E. T. Vishniac, Astrophys. J., 353 (1990) 1. 

[267] V. Icke, Astr. Astrophys., 27 (1973) 1. 

[268] V. Icke, Mon. Not. R. astr. Soc., 206 (1984) 1. 

[269] V. Icke and R. van de Weygaert, Astr. Astrophys., 184 (1987) 16. 

[270] S. Ikeuchi, K. Tomisaka and J. P. Ostriker, Astrophys. J., 265 (1983) 583. 

[271] S. Ikeuchi and M. Umemura, Prog. Theor. Phys., 72 (1984) 216. 

[272] M.B. Isichenko, Rev. Mod. Phys., 64 (1992) 961. 

[273] M. Itoh, S. Inagaki and W.C. Saslaw, Astrophys. J., 403 (1993) 476. 

[274] B. Jain and E. Bertschinger, Astrophys. J., 431 (1994) 495. 

[275] K. Janes and P. Demarque, Astrophys. J., 264 (1983) 206. 

[276] L.G. Jensen and A.S. Szalay, Astrophys. J., 305 (1986) L5. 

[277] M. Joeveer, J. Einasto and E. Tago, Mon. Not. R. astr. Soc, 185 (1978) 357. 

[278] W.A. Johnson and R.F. Mehl, Trans. Am. Inst. Min. Metal. Eng., 135 (1939) 
416. 



162 



[279] B.J.T. Jones, P. Coles and V.J. Martinez, Mon. Not. R. astr. Soc, 259 (1992) 
146. 

[280] B.J.T. Jones, V. J. Martinez, E. Saar and J. Einasto, Astrophys. J., 332 (1988) 
LI. 

[281] R. Juszkiewicz, Mon. Not. R. astr. Soc, 197 (1981) 931. 

[282] R. Juszkiewicz and F.R. Bouchet, in: Proceedings of the Second DAEC Meeting, 
Meudon, France eds. G. Mamon and D. Gerbal, (1991). 

[283] R. Juszkiewicz, F.R. Bouchet and S. Colombi, Astrophys. J., 412 (1993) L9. 

[284] R. Juszkiewicz, D.H. Weinberg, P. Amsterdamski, M. Chodorowski and F.R. 
Bouchet, preprint IASSNS-AST 93/50 (1993). 

[285] R. Juszkiewicz, D.H. Sonoda and J.D. Barrow, Mon. Not. R. astr. Soc, 209 
(1984) 139. 

[286] N. Kaiser, Astrophys. J., 284 (1984) L9. 

[287] N. Kaiser, Mon. Not. R. astr. Soc, 227 (1987) 1. 

[288] N. Kaiser, Mon. Not. R. astr. Soc, 231 (1988) 149. 

[289] N. Kaiser, in: M. Mezetti, G. Giuricin, F. Mardirossian and M. Ramella (eds), 
Large-scale Structure and Motions in the Universe (Kluwer, Dordrecht, 1989) 
p. 197. 

[290] N. Kaiser, in: D.W. Latham and L.N. da Costa (eds) Proceedings of the 
Texas /ESO-CERN Symposium on Relativistic Astrophysics, Cosmology and 
Fundamental Physics, Annals of the New York Academy of Sciences, 647 
(1991) p. 295. 

[291] N. Kaiser, G. Efstathiou, R.S. Ellis, C.S. Frenk, A. Lawrence, M. Rowan- 
Robinson and W. Saunders, Mon. Not. R. astr. Soc, 252 (1991) 1. 

[292] H. Kang, J. P. Ostriker, R. Cen, D. Ryu, L. Hernquist, A.E. Evrard, G.L. Bryan 
and M.L. Norman, Astrophys. J., 430 (1994) 83. 

[293] A. Kashlinsky, Astrophys. J., 317 (1987) 19. 

[294] A. Kashlinsky, in: D. W. Latham and L.N. da Costa (eds), Large-scale 
Structures and Peculiar Motions in the Universe, A. S. P. Conference Series 
15 (1991) p. 103 

[295] N. Katz and J.E. Gunn, Astrophys. J., 377 (1991) 365. 

[296] N. Katz, L. Hernquist and D.H. Weinberg, Astrophys. J., 399 (1992) 109. 

[297] N. Katz, T. Quinn and J.M. Gelb, Mon. Not. R. astr. Soc, 265 (1993) 689. 

[298] N. Katz and S.D.M. White, Astrophys. J., 412 (1993) 455. 

[299] G. Kauffman and A.P Fairall, Mon. Not. R. astr. Soc, 248 (1991) 313. 



163 



[300] G. Kauffman, B. Guiderdoni and S.D.M. White, Mon. Not. R. astr. Soc, 267 
(1994) 981. 

[301] G. Kauffman and A.L. Melott, Astrophys. J., 393 (1992) 415. 

[302] G. Kauffman and S.D.M. White, Mon. Not. R. astr. Soc, 261 (1993) 921. 

[303] G. Kauffman, S.D.M. White and B. Guiderdoni, Mon. Not. R. astr. Soc, 264 
(1993) 201. 

[304] M. Kendali and A. Stuart, The Advanced Theory of Statistics, Volume 1, 
(Griffin & Co., London, 1977). 

[305] T. Kiang, Zeitschr. f. Astrophys., 64 (1966) 433. 

[306] S. Kida, J. Fluid Mech., 93 (1979) 337. 

[307] J.C. Kirkwood, J. Chem. Phys., 3 (1935) 300. 

[308] R.P. Kirshner, A. Oemler, P.L. Schechter and S.A. Schechtman, Astrophys. J., 
248 (1981) L57. 

[309] R.P. Kirshner, A. Oemier, P.L. Schechter and S.A. Schechtman, Astr. J., 88 
(1983) 1285. 

[310] A. A. Klypin, J. Holtzman, J.R. Primack and E. Regos, Astrophys. J., 416 
(1993) 1. 

[311] A. A. Klypin and S.F. Shandarin, Mon. Not. R. astr. Soc, 204 (1983) 891. 

[312] A. A. Klypin and S.F. Shandarin, Astrophys. J., 413 (1993) 48. 

[313] H. Kodama and M. Sasaki, Prog. Theor. Phys. Suppl., 78 (1984) 1. 

[314] H. Kodama and M. Sasaki, Int. J. Mod. Phys. A., 2 (1987) 491. 

[315] L.A. Kofman, in: P. Flin and HW. Duerbeck (eds), Morphological Cosmology. 
Proceedings of 11th Krakow Cosmological School (Springer, Berlin, 1989) p. 
354. 

[316] L.A. Kofman, in: K. Sato (ed), Proc. IUAP Proceedings on Nucleosynthesis in 
the Universe (Kluwer, Dordrecht, 1991) 495. 

[317] L.A. Kofman, E. Bertschinger, J.M. Gelb, A. Nusser and A. Dekel, Astrophys. 
J., 420 (1994) 44. 

[318] L.A. Kofman, N.Y. Gnedin and N.A. Bahcall, Astrophys. J., 413 (1993) 1. 

[319] L.A. Kofman and D.Yu. Pogosyan, SISSA preprint: astroph/9403029 (1994). 

[320] L.A. Kofman, D.Yu. Pogosyan and S.F. Shandarin, Mon. Not. R. astr. Soc, 
242 (1990) 200. 

[321] L.A. Kofman, D.Yu. Pogosyan, S.F. Shandarin and A.L. Melott, Astrophys. 
J., 393 (1992) 437. 



164 



[322] L.A. Kofman and S. F. Shandarin, Nature, 334 (1988) 129. 

[323] L.A. Kofman and A. A. Starobinsky, Sov. Astron. Lett. 11 (1985) 271. 

[324] E.W. Kolb and M.S. Turner, The Early Universe (Addison-Wesley, Chicago, 
1990). 

[325] J.R. Kuhn and J.M. Uson, Astrophys. J., 263 (1982) L47. 

[326] C.G. Lacey and S. Cole, Mon. Not. R. astr. Soc, 262 (1993) 627. 

[327] C.G. Lacey, B. Guiderdoni, B. Rocca-Volmerange, and J. Silk, Astrophys. J., 
402 (1993) 15. 

[328] C.G. Lacey and J. Silk, Astrophys. J., 381 (1991) 14. 
[329] M. Lachieze-Rey, Astrophys. J., 408 (1993) 403. 

[330] O. Lahav, P.B. Lilje, J.R. Primack and M.J. Rees, Mon. Not. R. astr. Soc., 
251 (1991) 128. 

[331] O. Lahav, M. Itoh, S. Inagaki and Y. Suto, Astrophys. J., 402 (1993) 387. 
[332] K. Lake and R. Pim, Astrophys. J., 298 (1985) 439. 

[333] A. Lawrence, M. Rowan-Robinson, J. Crawford, I. Parry, X.-Y. Xia, R.S. Ellis, 
C.S. Frenk, W. Saunders, G. Efstathiou and N. Kaiser, Mon. Not. R. astr. Soc., 
to be published (1995). 

[334] A. Lemaitre, Mon. Not. R. astr. Soc., 91 (1931) 483. 

[335] A.R. Liddle and D.H. Lyth, Phys. Lett. B., 291 (1992) 391. 

[336] A.R. Liddle and D.H. Lyth, Mon. Not. R. astr. Soc, 265 (1993) 379. 

[337] A.R. Liddle and D.H. Lyth, Phys. Reports, 231 (1993) 1. 

[338] J.E. Lidsey and P. Coles, Mon. Not. R. astr. Soc, 258 (1992) 57P. 

[339] E.M. Lifshitz, Zh. Eksp. Teoret. Fiz. 16 (1946) 587. 

[340] E.M. Lifshitz and I.M. Khalatnikov, Sov. Phys. Usp., 6 (1964) 495. 

[341] P.B. Lilje and G. Efstathiou, Mon. Not. R. astr. Soc, 231 (1988) 635. 

[342] D.N. Limber, Astrophys. J., 119 (1954) 665. 

[343] C.C. Lin, L. Mestel and F.H. Shu, Astrophys. J., 142 (1965) 1431. 

[344] E.N. Ling, C.S . Frenk and J.D. Barrow, Mon. Not. R. astr. Soc, 223 (1986) 
2 IP. 

[345] B. Little and D.H. Weinberg, Mon. Not. R. astr. Soc, 267 (1994) 605. 

[346] E.L. Lokas, R. Juszkiewicz, D.H. Weinberg and F.R. Bouchet, Mon. Not. R. 
astr. Soc, submitted (1994). 



165 



[347] J. Loveday, G. Efstathiou, B.A. Paterson and S.J. Maddox, Astrophys. J., 400 

(1992) L43. 

[348] F. Lucchin, S. Matarrese and S. Mollerach, Astrophys. J., 401 (1992) L49. 

[349] F. Lucchin, S. Matarrese, A.L. Melott and L. Moscardini, Astrophys. J., 422 
(1994) 430. 

[350] S. L. Lumsden, A. F. Heavens and J. A. Peacock, Mon. Not. R. astr. Soc, 
238 (1989) 293. 

[351] X. Luo, Phys. Rev. D, 49 (1994) 3810. 

[352] X. Luo and D.N. Schramm, Phys. Rev. Lett, 71 (1993) 1124. 
[353] D. Lynden-Bell, Astrophys. J., 139 (1964) 1195. 

[354] D. Lynden-Bell, S. M. Faber, D. Burstein, R. L. Davies, A. Dressier, R. J. 
Terlevich and G. Wegner, Astrophys. J., 326 (1988) 19. 

[355] D. Lynden-Bell, O. Lahav and D. Burstein, Mon. Not. R. astr. Soc., 241 
(1989) 325. 

[356] S. J. Maddox, G. Efstathiou, W. J. Sutherland and J. Loveday, Mon. Not. R. 
astr. Soc., 242 (1990) 43P. 

[357] K. Maeda and H. Sato, Prog. Theor. Phys., 70 (1983) 772. 

[358] K. Maeda, M. Sasaki and H. Sato, Prog. Theor. Phys., 69 (1982) 89. 

[359] N. Makino, M. Sasaki and Y. Suto, Phys. Rev. D., 46 (1992) 585. 

[360] B. B. Mandelbrot, Fractals: Form, Chance, Dimension (Freeman, San 
Francisco, 1977). 

[361] B. B. Mandelbrot, The Fractal Geometry of Nature (Freeman, New York, 
1982). 

[362] R. G. Mann, A. F. Heavens and J. A. Peacock, Mon. Not. R. astr. Soc, 263 

(1993) 798. 

[363] S. Mao and C.S. Kochanek, Mon. Not. R. astr. Soc, 268 (1994) 569. 

[364] D. Maoz and H.-W. Rix, Astrophys. J., 416 (1993) 425. 

[365] H. Martel, Astrophys. J., 377 (1991) 7. 

[366] H. Martel and W. Freudling, Astrophys. J., 371 (1991) 1. 

[367] H. Martel and I. Wasserman, Astrophys. J., 348 (1990) 1. 

[368] V. J. Martinez and P. Coles, Astrophys. J., 437 (1994) 550. 

[369] V. J. Martinez and B. J. T. Jones, Mon. Not. R. astr. Soc, 242 (1990) 517. 

[370] V. J. Martinez, B. J. T. Jones, R. Dominguez-Tenreiro and R. van de 
Weygaert, Astrophys. J., 357 (1990) 50. 



166 



[371] S. Matarrese, F. Lucchin, L. Moscardini and D. Saez, Mon. Not. R. astr. Soc, 
259 (1992) 437. 

[372] S. Matarrese, O. Pantano and D. Saez, Phys. Rev. D, 47 (1993) 1311. 

[373] S. Matarrese, O. Pantano and D. Saez, Phys. Rev. Lett. 72 (1994) 320. 

[374] S. Matarrese, O. Pantano and D. Saez, Mon. Not. R. astr. Soc., 271 (1994) 
513. 

[375] J. C. Mather, E. S. Cheng, R. E. Eplee, R. B. Isaacman, S. S. Meyer, R. A. 
Shafer, R. Weiss, E. L. Wright, C. L. Bennett, N. W. Boggess, E. Dwek, S. 
Gulkis, M. G. Hauser, M. Janssen, T. Kelsall, P. M. Lubin, S. H. Moseley Jr, 
T. L. Murdock, R. F. Silverberg, G. F. Smoot and D. T. Wilkinson, Astrophys. 
J., 354 (1990) L37. 



[376; 
[377 
[378; 
[379 
[380 
[381 
[382; 
[383; 
[384 
[385; 
[386 

[387 
[388; 
[389 

[390 
[391 

[392 
[393 
[394 



J. C. Mather et al. Astrophys. J., 420 (1994) 439. 

T. Matsuda and E. Shima, Prog. Theor. Phys., 71 (1984) 855. 

S. Maurogordato and M. Lachieze-Rey, Astrophys. J., 320 (1987) 13. 

S. Maurogordato and M. Lachieze-Rey, Astrophys. J., 369 (1991) 30. 

P. Meakin, Phys. Rev. A , 36 (1987) 2833. 

K. R. Mecke, T. Buchert and H. Wagner, Astr. Astrophys., 288 (1994) 697. 

A. Meiksin, I. Szapudi and A. S. Szalay, Astrophys. J., 384 (1992) 87. 

A. L. Melott, Physics Reports, 193 (1990) 1. 

A. L. Melott, Comments on Astrophysics, 15 (1990) 1. 

A. L. Melott and J.L. Pauls, Phys. Rev. Lett. (1994) submitted. 

A. L. Melott, T. Pellman and S. F. Shandarin, Mon. Not. R. astr. Soc, 269 
(1994) 626. 

A. L. Melott and S. F. Shandarin, Astrophys. J., 342 (1989) 26. 

A.L. Melott and S.F. Shandarin, Nature 346 (1990) 633. 

A.L. Melott, F. Lucchin, S. Matarrese, and L. Moscardini, Mon. Not. R. astr. 
Soc, 268 (1994) 69. 

A. L. Melott, T. Buchert and A. G. Weiss, Astr. Astrophys. In Press. 

A.L. Melott, A.P. Cohen, A.J.S. Hamilton, J.R. Gott and D.H. Weinberg, 
Astrophys. J., 345 (1989) 618. 

A.L. Melott, S.F. Shandarin and D.H. Weinberg, Astrophys. J. 428 (1994) 28. 
P. Meszaros, Astr. Astrophys., 37 (1974) 225. 
J. L. Meyerling, Philips Res. Rept. 8 (1953) 270. 



167 



[395] R. E. Miles, Math. Biosci., 6 (1970) 270. 
[396] J. Moller, Adv. Appl. Prob. , 21 (1989) 37. 

[397] J. E. Moody, E. L. Turner and J. R. Gott, Astrophys. J., 273 (1983) 16. 

[398] B. Moore, C. S. Frenk, D. H. Weinberg, W. Saunders, A. Lawrence, M. Rowan- 
Robinson, N. Kaiser, G. Efstathiou and R. S. Ellis, Mon. Not. R. astr. Soc, 
256 (1992) 477. 

[399] J.G. More, A.F. Heavens and J.A. Peacock, Mon. Not. R. astr. Soc, 220 
(1986) 189. 

[400] L. Moscardini, S. Matarrese, F. Lucchin, and A. Messina, Mon. Not. R. astr. 
Soc, 248 (1991) 424. 

[401] F. Moutarde, J. M. Alimi, F. R. Bouchet, R. Pellat and A. Ramani, Astrophys. 
J., 382 (1991) 377. 

[402] V.F. Mukhanov, H.A. Feldman and R.H. Brandenberger, Phys. Rep., 215 
(1992) 203. 

[403] D. Munshi, V. Sahni and A. A. Starobinsky, Astrophys. J., 436 (1994) 517. 

[404] D. Munshi, T. Souradeep and A. A. Starobinsky, Astrophys. J., submitted 
(1995). 

[405] D. Munshi and A. A. Starobinsky, Astrophys. J., 428 (1994) 433. 

[406] H. Nariai and M. Fujimoto, Prog. Theor. Phys., 47 (1972) 105. 

[407] J.F. Navarro and W. Benz, Astrophys. J., 380 (1991) 320. 

[408] J.F. Navarro and S.D.M. White, Mon. Not. R. astr. Soc, 265 (1993) 271. 

[409] R. P. Nelson and J. C. B. Papaloizou, Mon. Not. R. astr. Soc, 265 (1993) 
905. 

[410] R.C. Nichol, C.A. Collins, L. Guzzo and S.L. Lumsden, Mon. Not. R. astr. 
Soc, 255 (1992) 21P. 

[411] R. Nityananda and T. Padmanabhan, Mon. Not. R. astr. Soc, 271 (1994) 
976. 

[412] A. Nusser and A. Dekel, Astrophys. J., 362 (1990) 14. 
[413] A. Nusser and A. Dekel, Astrophys. J., 391 (1992) 443. 
[414] A. Nusser and A. Dekel, Astrophys. J., 405 (1993) 437. 

[415] A. Nusser, A. Dekel, E. Bertschinger and G. R. Blumenthal, Astrophys. J., 
376 (1991) 6. 

[416] F. Occhionero, P. Santangelo and N. Vittorio, Astron. Astrophys. 117 (1983) 
365. 



168 



[417] K. A. Olive, G. Steigman and T. P. Walker, Astrophys. J., 380 (1991) LI. 

[418] S. Olivier, G. R. Blumenthal, A. Dekel, J. R. Primack and D. Stanhill, 
Astrophys. J., 356 (1990) 1. 

[419] S. Otto, H. D. Politzer, J. Preskill and M. B. Wise, Astrophys. J., 304 (1986) 
62. 

[420] T. Padmanabhan and D. Narasimha, Mon. Not. R. astr. Soc, 259 (1992) 41p. 

[421] T. Padmanabhan and K. Subramanian, Astrophys. J., 410 (1993) 482. 

[422] T. Padmanabhan, Structure formation in the Universe (Cambridge University 
Press, Cambridge, 1993). 

[423] G. Paladin and A. Vulpiani, Physics Reports, 156 (1987) 147. 

[424] M. Panek, Mon. Not. R. astr. Soc, 216 (1985) 85. 

[425] C. Park, J. R. Gott, A. L. Melott and I. D. Kharachentsev, Astrophys. J., 387 
(1992) 1. 

[426] C. Park, M.S. Vogeley, M.J. Geller and J. P. Huchra, Astrophys. J., 431 (1994) 
569. 

[427] R.B. Partridge and P.J.E. Peebles, Astrophys. J., 148 (1967) 377. 
[428] J. L. Pauls and A.L. Melott, Mon. Not. R. astr. Soc. in press (1995). 
[429] J.A. Peacock, Mon. Not. R. astr. Soc, 253 (1991) IP. 

[430] J.A. Peacock, in: V.J. Martinez, M.Portilla and D. Saez (eds), New Insights 
into the Universe. Proceedings, Valencia, Spain 1991 (Springer, Berlin, 1992) 
p. 1. 

[431] J.A. Peacock and S. Dodds, Mon. Not. R. astr. Soc, 267 (1994) 1020. 
[432] J.A. Peacock and A.F. Heavens, Mon. Not. R. astr. Soc, 217 (1985) 805. 
[433] J.A. Peacock and A.F. Heavens, Mon. Not. R. astr. Soc, 243 (1990) 133. 
[434] J.A. Peacock and D. Nicholson, Mon. Not. R. astr. Soc, 253 (1991) 307. 
[435] J.A. Peacock and M.J. West, Mon. Not. R. astr. Soc, 259 (1992) 494. 
[436] R.C. Pearson and P. Coles, Mon. Not. R. astr. Soc, 272 (1995) 231. 
[437] P.J.E. Peebles, Astrophys. J., 147 (1967) 859. 
[438] P.J.E. Peebles, Astrophys. J., 189 (1974) L51. 
[439] P.J.E. Peebles, Astr. Astrophys., 32 (1974) 197. 

[440] P.J.E. Peebles, The Large-scale Structure of the Universe (Princeton 
University Press, Princeton, 1980). 

[441] P.J.E. Peebles, Astrophys. J., 257 (1982) 438. 



169 



[442] P.J.E. Peebles, Astrophys. J., 258 (1982) 415. 
[443] P.J.E. Peebles, Astrophys. J., 263 (1982) LI. 
[444] P.J.E. Peebles, Astrophys. J., 284 (1984) 439. 
[445] P.J.E. Peebles, Astrophys. J., 315 (1987) L73. 
[446] P.J.E. Peebles, Astrophys. J., 362 (1990) 1. 

[447] P.J.E. Peebles, Principles of Physical Cosmology (Princeton University Press, 
Princeton, 1993). 

[448] P.J.E. Peebles, Astrophys. J., 429 (1994) 43. 

[449] P.J.E. Peebles and E.J. Groth, Astrophys. J., 196 (1975) 1. 

[450] L. Perivolaropoulos, R. Brandenberger and A. Stebbins, Phys. Rev. D, 41 
(1990) 1764. 

[451] V. Petrosian, E.E. Salpeter and P. Szekeres, Astrophys. J., 147 (1967) 1222. 

[452] L. Pietronero, Physica, 144A (1987) 257. 

[453] R. Pirn and K. Lake, Astrophys. J., 304 (1986) 75. 

[454] R. Pirn and K. Lake, Astrophys. J., 330 (1988) 625. 

[455] M. Plionis, R. Valdarnini and P. Coles, Mon. Not. R. astr. Soc, 258 (1992) 
114. 

[456] D. Yu Pogosyan, Tartu Preprint, Estonian Acad. Sci. (1989). 

[457] D. Yu Pogosyan and A. A. Starobinsky, Mon. Not. R. astr. Soc, 265 (1993) 
507. 

[458] H.D. Politzer and J. Preskill, Phys. Rev. Lett, 56 (1986) 99. 

[459] H.D. Politzer and M.B. Wise, Astrophys. J., 285 (1984) LI. 

[460] M. Postman, M.J. Geller and J.P. Huchra, Astr. J., 91 (1986) 1267. 

[461] W.H. Press and P.L. Schechter, Astrophys. J., 187 (1974) 425. 

[462] J.R. Primack and G.R. Blumenthal, in: F. Mardirossian, G. Giuricin and M. 
Mezzetti (eds), Clusters and Groups of Galaxies, (Reidel, Dordrecht, 1984) p. 
435. 

[463] J.R. Primack, J. Holtzman, A. Klypin and D.O. Caldwell, Phys. Rev. Lett., in 
press (1995). 

[464] E. Regos and M.J. Geller, Astrophys. J., 377 (1991) 14. 
[465] S.O. Rice, Bell Systems Tech. Journal, 24 (1945) 46. 

[466] S.O. Rice, in: N. Wax (ed), Selected Papers on Noise and Stochastic Processes 
(Dover, New York, 1954) p. 133. 



170 



[467] L. Ruamsuwan and J.N. Fry, Astrophys. J., 396 (1992) 416. 

[468] V.A. Rubakov, M.V. Sazhin and A.V. Veryaskin, Phys. Lett. B., 115 (1982) 
189. 

[469] B. Ryden and M. Gramann, Astrophys. J., 383 (1991) L33. 

[470] D. Ryu, J. P. Ostriker, H. Kang and R. Cen, Astrophys. J., 414 (1993) 1. 

[471] R.K. Sachs and A.M. Wolfe, Astrophys. J., 147 (1967) 73. 

[472] V. Sahni, Pramana, 15 (1980) 423. 

[473] V. Sahni, One-loop Quantum Gravitational Effects and the Growth of Density 
Perturbations in Cosmological Models, (PhD Thesis, Moscow State University, 
1984) 

[474] V. Sahni, in: Proceedings of Texas /ESO-CERN Symposium on Relativistic 
Astrophysics, Cosmology and Fundamental Physics, Annals of the New York 
Academy of Sciences, 647 (1991) 749. 

[475] V. Sahni, H. Feldman and A. Stebbins, Astrophys. J., 385 (1992) 1. 

[476] V. Sahni, B.S. Sathyaprakash and S.F. Shandarin, Astrophys. J., 431 (1994) 
20. 

[477] V. Sahni, B.S. Sathyaprakash and D. Pogosyan, In preparation (1995). 

[478] V. Sahni, B.S. Sathyaprakash and S.F. Shandarin, In preparation (1995). 

[479] V. Sahni and T. Souradeep, in: M. C. Bento, O. Bertolami, JM. Mourao and 
R. F. Picken (eds), Classical and Quantum Gravity: Proceedings of the First 
Iberian Meeting on Gravity, Evora, Portugal 21 - 26 September 1992 (World 
Scientific, Singapore, 1993) p. 256. 

[480] J.K. Salmon, "Parallel Hierarchical N-body Methods", PhD thesis, Caltech 
(1991) Unpublished. 

[481] J.K. Salmon and M.S. Warren, J. Comp. Phys. , in press (1994). 

[482] D.S. Salopek, Phys. Rev. Lett, 69 (1992) 3602. 

[483] D.S. Salopek, J.M. Stewart and K.M. Croudace, Mon. Not. R. astr. Soc, 271 
(1994) 1005. 

[484] D.S. Salopek and J.M. Stewart, Phys. Rev. D, 51 (1995) 517. 
[485] W.C. Saslaw, Astrophys. J., 235 (1980) 299. 

[486] W.C. Saslaw, Gravitational Physics of Stellar and Galactic Systems 
(Cambridge University Press, Cambridge, 1985). 

[487] W. C. Saslaw, Astrophys. J., 304 (1986) 11. 

[488] W.C. Saslaw, Astrophys. J., 341 (1989) 588. 



171 



[489] W.C. Saslaw, Astrophys. J., 391 (1991) 423. 

[490] W.C. Saslaw, S.M. Chitre, M. Itoh and S. Inagaki, Astrophys. J., 365 (1990) 
419. 

[491] W.C. Saslaw and A.J.S. Hamilton, Astrophys. J., 276 (1984) 13. 
[492] W.C. Saslaw and R.K. Sheth, Astrophys. J., 409, 504. 

[493] B. S. Sathyaprakash, V. Sahni, D. Munshi, D. Pogosyan and A. L. Melott, 
Mon. Not. R. ast. Soc. (1995) In press. 

[494] B. S. Sathyaprakash, V. Sahni, S.F. Shandarin and K.B. Fisher, In preparation 
1995. 

[495] H. Sato, Prog. Theor. Phys., 68 (1982) 236. 

[496] H. Sato and K. Maeda, Prog. Theor. Phys., 70 (1983) 119. 

[497] W. Saunders, C.S. Frenk, M. Rowan-Robinson, G. Efstathiou, A. Lawrence, 
N. Kaiser, R.S. Ellis, J. Crawford, X.-Y. Xia and I. Parry, Nature, 349 (1991) 
32. 

[498] R. Schaeffer and J. Silk, Astrophys. J., 292 (1985) 319. 
[499] R. Schaeffer and J. Silk, Astrophys. J., 322 (1988), 1. 
[500] P.L. Schechter, Astrophys. J., 203 (1976) 247. 

[501] R.J. Scherrer, A.L. Melott and S.F. Shandarin, Astrophys. J., 377 (1991) 29. 

[502] J. Schwarz, J. P. Ostriker and A. Yahil, Astrophys. J., 202 (1975) 1. 

[503] L.I. Sedov, Similarity and Dimensional Methods in Mechanics (Academic 
Press, New York, 1959) 

[504] M. Seldner and P.J.E. Peebles, Astrophys. J., 215 (1977) 703. 

[505] S.F. Shandarin, Sov. Astron. Lett, 9 (1983) 104. 

[506] S.F. Shandarin, Kansas University preprint (1994); To appear in: Proceedings 
of the Conference "Cosmic Velocity Fields", Paris, July 1993 

[507] S.F. Shandarin, A.G. Doroshkevich and Ya.B. Zel'dovich, Sov. Phys. Usp., 26 
(1983) 46. 

[508] S.F. Shandarin and Ya.B. Zel'dovich, Comments on Astrophysics, 10 (1983) 
33. 

[509] S.F. Shandarin and Ya.B. Zel'dovich, Rev. Mod. Phys., 61 (1989) 185. 
[510] CD. Shane and C.A. Wirtanen, Publ. Lick. 06a., 22 (1967), part 1. 
[511] T. Shanks, Mon. Not. R. astr. Soc, 186 (1979) 583. 

[512] T. Shanks, D. Hale-Sutton, R. Fong and N. Metcalfe, Mon. Not. R. astr. Soc, 
237 (1989), 589. 



172 



[513] P. Shapiro and K. Struck-Marcell, Astrophys. J. SuppL, 57 (1984) 205. 

[514] N.A. Sharp, S.A. Bonometto and F. Lucchin, Astr. Astrophys., 130 (1984) 79. 

[515] R.K. Sheth, H.J. Mo and W.C. Saslaw, Astrophys. J., 427 (1994) 562. 

[516] J. Silk and N. Vittorio, in: J. Audouze and F. Melchiorri (eds), Confrontation 
between Theories and Observations in Cosmology: Present Status and Future 
Programs. Proceedings of the International School of Physics "Enrico Fermi", 
(North-Holland, Amsterdam, 1990) p. 137. 

[517] V. Silviera and I. Waga, Phys. Rev. D, 50 (1994) 4890. 

[518] G.F. Smoot, C.L. Bennett, A. Kogut, E.L. Wright, J. Aymon, N.W. Boggess, 
E.S. Cheng, G. de Amici, S. Gulkis, M.G. Hauser, G. Hinshaw, P.D. Jackson, 
M. Janssen, E. Kaita, T. Kelsall, P. Keegstra, C. Lineweaver, K. Loewenstein, 
P. Lubin, J. Mather, S.S. Meyer, S.H. Moseley, T. Murdock, L. Rokke, R.F. 
Silverberg, L. Tenorio, R. Weiss and D.T. Wilkinson, Astrophys. J., 396 (1992) 
LI. 

[519] G.F. Smoot, L. Tenorio, A.J. Banday, A. Kogut, E.L. Wright, G. Hinshaw and 
C.L. Bennett, Astrophys. J., submitted (1993). 

[520] J. Soda and Y. Suto, Astrophys. J., 396 (1992) 379. 

[521] L.V. Solov'eva and I.S. Nurgaliev, Sov. Astr., 29 (1985) 267. 

[522] L.V. Solov'eva and A. A. Starobinsky, Sov. Astr., 29 (1985) 367. 

[523] T. Souradeep and V. Sahni, Mod. Phys. Lett, A7 (1992) 3541. 

[524] A. A. Starobinsky, Sov. Astr. Lett., 11 (1985) 133. 

[525] A. A. Starobinsky and V. Sahni, in: Modern Theoretical and Experimental 
Problems of General Relativity (MGI Press, Moscow, 1984) p. 77. 

[526] D. Stoyan, W.S. Kendall and J. Mecke, Stochastic geometry and its 
applications, (Akademie-Verlag, Berlin, 1987) 

[527] M.A. Strauss and M. Davis, in: V. Rubin and G. V. Coyne (eds), Large-scale 
Motions in the Universe (Princeton University Press, Princeton, 1988) p. 256. 

[528] M.A. Strauss, M. Davis, A. Yahil and J. P. Huchra, Astrophys. J., 385 (1992) 
421. 

[529] M.F. Struble and H.J. Rood, Astrophys. J. SuppL, 77 (1991) 263. 

[530] M.P. Susperregi and J.J. Binney, Mon. Not. R. astr. Soc, 271 (1994) 719. 

[531] W.J. Sutherland, Mon. Not. R. astr. Soc, 234 (1988) 159. 

[532] W.J. Sutherland and G. Efstathiou, Mon. Not. R. astr. Soc, 248 (1991) 159. 

[533] Y. Suto, M. Itoh and S. Inagaki, Astrophys. J., 350 (1990) 492. 

[534] Y. Suto and M. Sasaki, Phys. Rev. Letters, 66 (1991) 264. 



173 



[535] A.S. Szalay in: Large Scale Structures in the Universe, Seventeenth advanced 
course of the Swiss society of Astronomy and Astrophysics, Saas-Fee (Geneva 
Observatory, Switzerland, 1987). 

[536] I. Szapudi and A.S. Szalay, Astrophys. J., 403 (1993) 43. 

[537] I. Szapudi, A.S. Szalay and P. Boschan, Astrophys. J., 390 (1992) 350. 

[538] R.C. Tolman, Proc. Nat. Acad. Sci. 20 (1934) 169. 

[539] M. Totsuji and T. Kihara, Pub. Astr. Soc. Jap., 21 (1969) 221. 

[540] R.B. Tully and J.R. Fisher, Nearby Galaxy Atlas (Cambridge University Press, 
Cambridge, 1987). 

[541] E.L. Turner, Astrophys. J., 365 (1990) L43. 

[542] R. van de Weygaert and V. Icke, Astron. Astrophys. 213 (1989) 1. 

[543] R. van de Weygaert and A. Babul, Astrophys. J. Lett. 425 (1994) L59. 

[544] R. van de Weygaert and E. van Kampen, Mon. Not. R. astr. Soc, 263 (1993) 
481. 

[545] R. van de Weygaert, Astron. Astrophys. 283 (1994) 361. 

[546] M. Vergassola, B. Dubrulle, U. Frisch and A. Noullez, Astron. Astrophys. 289 
(1994) 325. 

[547] E.T. Vishniac, Mon. Not. R. astr. Soc, 203 (1983) 345. 

[548] E.T. Vishniac, in: EW. Kolb, M.S. Turner, D. Lindley, K. Olive and D. Seckel 
(eds), Inner Space/ Outer Space (University of Chicago Press, Chicago, 1986) 
p. 190. 

[549] N. Vittorio and M.S. Turner, Astrophys. J., 316 (1987) 475. 

[550] S.V. Vogeley, M.J. Geller and J.P. Huchra, Astrophys. J., 382 (1991) 44. 

[551] S.V. Vogeley, C. Park, M.J. Geller and J.P. Huchra, Astrophys. J., 391 (1992) 
L5. 

[552] G. Voronoi, J. reine angew. Math. 134 (1908) 198. 

[553] M.S. Warren, P.J. Quinn, J.K. Salmon and W.H. Zurek, Astrophys. J., 399 
(1992) 405. 

[554] I. Wasserman, Astrophys. J., 248 (1981) 1. 

[555] G. Wegner, J.R. Thorstensen, M. J. Kurtz, M.J. Geller and J.P. Huchra, Astr. 
J., 100 (1990) 1405. 

[556] S. Weinberg, Phys. Rev. Lett, 59 (1987) 2607. 

[557] D.H. Weinberg and S. Cole, Mon. Not. R. astr. Soc, 259 (1992) 652. 



174 



[558] D.H. Weinberg, J. P. Ostriker and A. Dekel, Astrophys. J., 336 (1989) 9. 

[559] D.H. Weinberg and J.E. Gunn, Mon. Not. R. astr. Soc, 247 (1990) 260. 

[560] D H. Weinberg and J.E. Gunn, Astrophys. J., 352 (1990) L25. 

[561] D. Weistrop, P. Hintzen, R.C. Kennicutt, C. Liu, J. Lowenthal, K.P. Cheng, 
R. Oliverson, and B. Woodgate, Astrophys. J., 396 (1992) L23. 

[562] M.J. West, D.H. Weinberg and A. Dekel, Astrophys. J., 353 (1990) 329. 

[563] M. White, D. Scott and J. Silk, Ann. Rev. Astron. Astrophys., 32 (1994) 319. 

[564] S.D.M. White, Mon. Not. R. astr. Soc, 186 (1979) 145. 

[565] S.D.M. White and C.S. Frenk, Astrophys. J., 379 (1991), 52. 

[566] G.B. Whitham, Linear and Non-linear Waves, (Wiley, New York, 1974). 

[567] B.G. Williams, Ph.D. Thesis, University of Edinburgh, (1992). 

[568] B.G. Williams, J. A. Peacock and A.F. Heavens, Mon. Not. R. astr. Soc., 252 
(1991) 43. 

[569] B.G. Williams, A.F. Heavens, J. A. Peacock and S.F. Shandarin, Mon. Not. R. 
astr. Soc., 250 (1991) 458. 

[570] E.L. Wright, et al. Astrophys. J., 420 (1994) 1. 

[571] A. Yahil, in: V. Rubin and G.V. Coyne (eds), Large-scale Motions in the 
Universe (Princeton University Press, Princeton, 1988) p. 219. 

[572] S. Zaroubi and Y. Hoffman, Astrophys. J., 414 (1993) 20. 

[573] Ya.B. Zel'dovich, Astron. Zh., 41 (1964) 873. 

[574] Ya.B. Zel'dovich, Astr. Astrophys., 4 (1970) 84. 

[575] Ya.B. Zel'dovich, Mon. Not. R. astr. Soc, 160 (1972) IP. 

[576] Ya.B. Zel'dovich, Sov. Astron. Lett, 8 (1982) 102. 

[577] Ya B. Zel'dovich, J. Einasto and S.F. Shandarin, Nature, 300 (1982) 407. 

[578] Ya.B. Zel'dovich, A.V. Mamaev and S.F. Shandarin, Sov. Phys. Usp., 26 
(1983) 77. 

[579] Ya.B. Zel'dovich and I.D. Novikov, The Structure and Evolution of the 
Universe, (University of Chicago Press, Chicago, 1983). 

[580] Ya.B. Zel'dovich and S.F. Shandarin, Sov. Astron. Lett, 8 (1982) 67. 

[581] J.M. Ziman, Models of Disorder (Cambridge University Press, Cambridge, 
1979). 

[582] F. Zwicky, Morphological Astronomy (Springer, Berlin, 1957). 

[583] F. Zwicky, E. Herzog, P. Wild, M. Karpowicz and C.T. Kowal, Catalog of 
Galaxies and Clusters of Galaxies Vols. 1-6, (Pasadena: Calif. Inst. Technol. 
1961 - 1968). 



175 



